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METHODS FOR ALTERING THE EXPRESSION OF HYPHAL-SPECIFIC GENES 

5 

CROSS REFERENCE TO RELATED APPLICATIONS 

The present invention is related to and, in accordance with the provisions of 35 U.S.C. 
§ 120, claims the benefit of provisional patent apphcation Serial No. 60/167,672, filed 29 
November 1 999, which is expressly incorporated fully herein by reference. 

10 

FIELD OF THE INVENTION 

The present invention relates to methodologies and molecular targets for the 
prevention and treatment of microbial infection of a mammalian host by altering the 
transcription of hyphal-specific genes. Preferably, these methods and molecular targets may 
15 be used in the prevention or treatment of microbial infection of mammalian hosts, such as in 
immunocompromised patients at risk for opportunistic fungal infections, organ transplant 
patients, cancer patients undergoing chemotherapy, bum patients, AIDS patients, or patients 
with diabetic ketoacidosis. 

20 BACKGROUND OF THE INVENTION 

Fungi are not only important human and animal pathogens, but they are also among 
the most common causes of plant disease. Fungal infections (mycoses) are becoming a major 
concem for a number of reasons, including the limited number of antifungal agents available, 
the increasing incidence of species resistant to known antifungal agents^ and the growing 

25 population of immimocompromised patients at risk for opportunistic fungal infections, such 
as organ transplant patients, cancer patients undergoing chemotherapy, bum patients, AIDS 
patients, or patients with diabetic ketoacidosis. The most common clinical isolate is Candida 
albicans (comprising about 19% of all isolates). In one study, nearly 40% of all deaths from 
hospital-acquired infections were due to fungi. Sternberg, 266(5191) SCIENCE 1632-34 

30 (1994). 

The yeast Candida albicans (C albicans) is one of the most pervasive fungal 
pathogens in humans. It has the capacity to opportunistically infect a diverse spectrum of 
compromised hosts, and to invade many diverse tissues in the himian body. It can in many 



instances evade antibiotic treatment and the immune system. Although C albicans is a 
member of the normal flora of the mucous membranes in the respiratory, gastrointestinal, and 
female genital tracts, in such locations it may gain dominance and be associated with 
pathologic conditions. Sometimes it produces progressive systemic disease in debilitated or 
5 immunosuppressed patients, particularly if cell-mediated immunity is impaired. Sepsis may 
occur in patients with compromised cellular inmiunity, e.g., those undergoing cancer 
chemotherapy or those with lymphoma, AIDS, or other conditions. Candida may produce 
bloodstream invasion, thrombophlebitis, endocarditis, or infection of the eyes and virtually 
any organ or tissue when introduced intravenously, e.g., via tubing, needles, narcotics abuse, 
10 etc. 

C albicans has been shown to be diploid with balanced lethals, and therefore 
probably does not go through a sexual phase or meiotic cycle. This yeast appears to be able 
to spontaneously and reversibly switch at high frequency between at least seven general 
phenotypes. Switching has been shown to occur not only in standard laboratory strains, but 

15 also in strains isolated from the mouths of healthy individuals. 

Oropharyngeal and esophageal candidiasis are among the most frequent opportunistic 
fiingal infections observed in human immunodeficiency virus positive (HrV+) and AIDS 
patients, occurring in the majority of patients. The pathogenesis is complex and is thought to 
involve multiple host factors that include loss of cell mediated immunity and altered 

20 phagocytic cell activity. The current status of the AIDS epidemic is one of increasing 

numbers of individuals infected and no cure. Many infected individuals may live for a long 
time with HIV in an essentially permanent immunocompromised state. Because of the loss 
of the cellular component of the immune system, AIDS patients are susceptible to invasion of 
submucosal tissue by C. albicans. The frequency of candidal infections may also be a result 

25 of the prophylactic use of antibacterial drugs used in AIDS patients to minimize other 

opportunistic infections. Candidal infections increase in severity and recur more frequently 
as the immunodeficiency progresses. Although C. albicans is sensitive to antifungal drugs, 
treatment over long periods of time are required, and isolates from HIV infected patients may 
be more resistant that other isolates. 

30 In addition to HIV affected patients, oral candidiasis occurs in cancer patients, e.g., 

leukemia patients, as well as in patients with other underlying diseases. The most commonly 
associated disease with oral candidiasis, however, is denture stomatites, which is generally 
observed in denture wearers. 
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A feature of C. albicans growth that is correlated with pathogenicity in the oral cavity 
is the ability to transform from budding to filament-extending growth. Filamentous forms 
adhere more readily to buccal epithelial cells (BECs) than budding yeasts, and histologically 
are a prominent feature of invasion of the mucosa. Alternatively, in mucosal and systemic 
5 disease, C. albicans exists as a polymorphic set of growth forms termed yeasts, 

pseudohyphae and true hyphae. In mucosal disease, filamentous forms, particularly true 
hyphae, invade the keratinized layer of differentiated, stratified squamous epithelium. True 
hyphae are septate, cylindrical structures with parallel sides that are formed by extension of 
germ tubes which emerge fi-om yeasts in appropriate environmental conditions. Therefore, 

10 knowledge of the molecular events that transform C albicans to the pathogenic filamentous 
form as well as detailed investigations of the hyphal surface at the molecular level are 
necessary for understanding the pathogenesis of candidiasis. 

If left untreated, Candida infections firequently lead to the death of the patients. 
Nystatin, ketoconazole, and amphotericin B are drugs which are used to treat oral and 

1 5 systemic Candida infections. However, orally administered nystatin is limited to treatment 
within the gut and is not applicable to systemic treatment. Some systemic infections are 
susceptible to treatment with ketoconazole or amphotericin B, but these drugs may not be 
effective in such treatment unless combined with additional drugs. Amphotericin B has a 
relatively narrow therapeutic index and numerous undesireable side effects and toxicities 

20 occur even at therapeutic concentrations. While ketoconazole and other azole antifungals 
exhibit significantly lower toxicity, their mechanism of action, inactivation of cytochrome 
P450 prosthetic group in certain enzymes (some of which are found in humans) precludes use 
in patients that are simultaneously receiving other drugs that are metabolized by the body's 
cytochrome P450 enzymes, 5ee, e.g., U.S. Pat, No. 5,863,762. 

25 Other knovm antifungal agents include: polyene derivatives, such as amphotericin B 

(including lipid or liposomal formulations thereof) and the structurally related compounds 
nystatin and pimaricin; flucytosine (5-fluorocytosine); azole derivatives (including 
ketoconazole, clotrimazole, miconazole, econazole, butoconazole, oxiconazole, sulconazole, 
tioconazole, terconazole, fluconazole, itraconazole, voriconazole [Pfizer] and SCH56592 

30 [Schering-Plough]); allylamines-thiocarbamates (including tolnaftate, naftifine and 
terbinafine); griseofulvin; ciclopirox; haloprogin; echinocandins (including MK-0991 
[Merck]); nikkomycins; and, bactericidal/permeability-increasing protein (BPI), described in 
U.S. Pat. Nos. 5,627,153; 5,858,974; 5,652,332; 5,763,567; and 5,733,872. 
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Resistance of bacteria and other pathogenic organisms to antimicrobial agents is an 
increasingly troublesome problem. The accelerating development of antibiotic-resistant 
bacteria, intensified by the widespread use of antibiotics in farm animals and overprescription 
of antibiotics by physicians, has been accompanied by declining research into new antibiotics 
5 with different modes of action. Travis J., 264(5 157) Science 360-374 ( 1 994). 

Indeed, there is a need for an effective treatment of opportunistic infections caused by 
C albicans for the aforementioned reasons. Therefore, one objective of the present invention 
is to provide screening assays for identifying potential inhibitors of C albicans growth. 
Another object of the present invention is to identify potential inhibitors of C. albicans 

10 growth that are based on the alteration of transcription in this organism. 

Whether pathogenic or opportunistic, microorganisms have evolved numerous 
mechanisms to facilitate their establishment and proliferation in mammalian hosts. For 
example, during initial infection, the interaction between the microorganism and the host may 
include attachment or adhesion of the microorganism to the host cell surface, invasion of the 

15 host cells by the microorganism, and an elaboration of toxins by the microorganism. Li 
certain instances, the microorganism-host cell interaction may be specific or non-specific. 
Typically, a specific microorganism-host cell interaction may involve the specific binding of 
the microorganism to a specific receptor or receptor complex expressed on the host cell 
surface. Subsequently, the binding event may trigger changes in the microorganism and/or 

20 the host cell, leading to the progression of infection. 

The role of host cell molecules involved in certain microbial interactions has been 
determined in some cases. Mammalian transglutaminases are examples for which the 
molecular mechanisms of action and/or the role in host cell growth and development have 
been elucidated. Generally, transglutaminases are enzymes that catalyze intermolecular 

25 crosslinks by the formation of highly stable isopeptide bonds between the y-carbonyl group of 
glutamine and the s-amino group of lysine residues, which are resistance to proteases, sodixim 
dodecyl sulfate, and heat. Epithelial cell transglutaminases are important for the formation of 
the comified envelopes of mature squamous epitheilial cells. 

Investigators have shown that certain microorganisms may express proteins capable 

30 of acting as substrates for mammalian transglutaminases. One example is the hypha-specific 
hyphal wall protein 1 (HWPl) of C albicans. HWPl is a developmentally-regulated cell 
wall protein that is expressed exclusively in true hypha in cultures and in animal tissues. 
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Staab et al., 283(5407) Science 1535-38 (1999); Staab et ah, 271(1 1) J. Biol. Chem. 6298- 
305 (1996); Staab & Sundstrom, 14(7) Yeast 681-86 (1998). 

HWPl consists of an N-terminal proline and glutamine-rich repetitive amino acid 
sequence, which is exposed on the hyphal surface, and a cell wall-anchored serine and 
5 threonine rich C-terminus. The composition of the N-terminal amino acid repeat is 

reminiscent of mammalian transglutaminase substrates. HWPl has a profound effect on 
adherence of C albicans to BECs through a novel mechanism that involves host enzymes. 
Indeed, HWPl may serve as a substrate in transglutaminase-mediated cross-linking reactions, 
thereby forming isopeptide bonds to proteins on the surfaces of host cells. Furthermore, 
10 HWPl encodes an uncoventional adhesin that is necessary for the pathogenesis of 
candidiasis. 

Therefore, by defining the molecular events leading to the expression of a 
pathogenically important adhesin, and through the identification of new genes that are co- 
regulated with HWPl in a putative global regulatory circuit, the present invention has strong 

1 5 potential for identifying new and novel ways to interfere with candidiasis. The long term 

medical benefits of the present invention maybe the development of alternative or adjimctive 
therapies based on new knowledge about expression of hypha-specific genes in C. albicans. 

Accordingly, an objective of the present invention includes identifying and 
characterizing the 5' and 3' sequences flanking the HWPl structutal gene. An additional 

20 objective of the present invention includes the indentification and characterization of DNA 
binding proteins (DNA BP) that regulate transcription of HWPl . 

Other objects, features and advantages of the present invention will become apparent 
fi-om the following detailed description. The detailed description and the specific examples, 
however, indicate only preferred embodiments of the invention. Various changes and 

25 modifications within the spirit and scope of the invention will become apparent to those 
skilled in the art firom this detailed description. 

SUMMARY OF THE INVENTION 

A specific embodiment of the present invention relates to a method for interfering 
30 with the expression of hyphal-specific genes in a fungus, which results in the inhibition of the 
grov^h of the fimgus. This particular embodiment of the present invention is accomplished 
by interfering with the transcription of hyphal-specific genes mediated by cis acting 
sequences. In a particular embodiment, the fiingus comprises a pathogenic or nonpathogenic 
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yeast strain. Specifically, the pathogenic fungus comprises C albicans. In a fiirther 
embodiment of the present invention, the cis acting sequences comprises cf^-regulatory 
sequences, specifically, a UAS and a URS, 

In another embodiment of the present invention, the interfering step comprises 
5 interfering with DNA BP that bind to the c/5-regulatory elements. Further, the interfering 
step may comprise interfering with the regulatory proteins specific to the DNA BP. Further 
yet, the interfering step may comprise manipulating factors that interfere with the 
transcription of the hyphal-specific genes. Finally, the interfering step may comprise 
manipulating the signal transduction pathways that influence the transcription of the hyphal- 

10 specific genes. 

In an alternative embodiment, the present invention contemplates a method of 
interfering with the DNA BP by manipulating the binding of the DNA BP to the cis- 
regulatory elements. Further, this embodiment may comprise interfering with the expression 
of the DNA BP or by altering the ability of the DNA BP to bind to the target c/^-regulatory 

15 element. Further yet, the c/s-regulatory elements comprise the NIT2 binding site. 

In an additional embodiment of the present invention, the DNA BP is encoded by a 
nucleotide sequence for the DNA binding domain that is homologous to a nucleotide 
sequence that encodes the DNA binding domain of the NIT2 binding proteins. Specifically, 
the NIT2 binding domain comprises the protein sequence listed in SEQ. ID. NOS: 2 or 3. 

20 The DNA BP may be selected fi-om the group consisting of GAT99, GATl, and GATA-like 
binding proteins. 

In a further embodiment of the present invention, manipulating environmental factors. 

Regulation of the environmental factors may comprise regulating the environmental 

temperature of the fungus and/or modifying the nitrogen available to the fimgus. A further 
25 embodiment of the present invention comprises manipulating the signal transduction 

pathways that influence the transcription of the hyphal-specific genes. Specifically, the 

signal transduction pathway comprises the cAMP-dependant signalling pathway. 

In a particular embodiment of the present invention, the hyphal-specific genes may be 

selected from the group consisting ofHYRl, ECEl,ALS3, CHS2, and SAP6. Specifically, 
30 the hyphal-specific gene of the present invention comprises HWP\ . In a further embodiment 

of the present invention, the 5' and 3' flanking sequences have been determined. 

Specifically, the 5' flanking sequence of HWPl comprises the sequence listed in SEQ. ID. 

NO: 1 and the 3' flanking region comprises the sequence listed in SEQ. ID. NO: 4. Further, 
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the hyphal-specific genes comprises genes responsible for controlling dimorphism. In 
particular the hyphal-specific gene, HWPl encodes an unconventional adhesin that may be 
essential for adhesion of the fungus to the patient. 

In yet a further embodiment of the present invention, the patients may be 
5 immunocompromised and at risk for opportunistic fungal infections. Li particular the patient 
may be, but is not limited to, an organ transplant recipient, a cancer patient undergoing 
chemotherapy, a bum patient, an AIDS patients, or a patient with diabetic ketoacidosis. 

In a final embodiment, the present invention provides a methodology for 
characterizing genes under the control of DNA BP in a fungus. This embodiment is 
10 accomplished by creating a genomic library isolated fi-om a fimgus, specifically C albicans^ 
screening the genomic library, and isolating and sequencing the resultant clones. 



BRIEF DESCRIPTION OF THE DRAWINGS 



15 FIGURE 1 illustrates the pHWPlURAS plasmid containing an interrupted HWPl gene. The 
plasmid was constructed by first isolating two genomic BamHI fi-agments containing HWPl 
half clones from a genomic library constructed in Lambda GEM- 12. The BamHI fragments 
were then cloned in pBluescriptSK' (Stratagene, La Jolla, CA) to generate a LO kb insert and 
a 3.0 kb insert. An uninterrupted HWPl gene was created by ligating the 3.0 kb insert to 

20 create pGBHWPl . An inactive HWPl gene was created by replacing HWPl DNA between 
the Bel I and Bgl D sites (394 bp) with the 4.0 kb hisG-URA3-hisG cassette. The segments 
from A to B and E to F are HWPl DNA; the segment from B to C and D to E are the hisG 
repeats; and the segment from C to D is URA3. Digestion with Hind III released the 
disruption cassette from the vector sequences. 

25 

FIGURE 2 illustrates the role of HWPl in health of mice orally colonized with C albicans. 
Gnotobiotic nude mice that received C. albicans strains with HWPl were more prone to 
failure to thrive than mice that received the C albicans hwpl null mutant strain. Only two of 
seven beige nude mice given the -containing heterozygous C albicans strain and one 

30 of four mice given the HWPl revertant remained healthy for six weeks. Only two of 1 1 mice 
that received the hwpl null mutant strain of C albicans became ill. The survival difference 
between groups of mice that received the HWPl heterozygote strain CAH7 compared to the 
hwpl null mutant was statistically significant ( P < 0.05). The combined groups of mice that 
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received the HWPl containing heterozygous and revertant, also showed a significant 
difference in survival from mice receiving the hwpl null mutant strain. The absence of a 
significant difference in the survival of mice that received the revertant strain and the hwpl 
null mutant (P = 0.058) was because one of the mice was sacrificed for "comparison" prior 
5 to becoming ill. When this mouse is removed fi-om the analysis, the survival difference 

between null and revertant is significant. The survival of Epsilon 26 mice also depended on 
the presence of HWPI in C albicans strains used for monoassociation. All mice that 
received //fFF7 -containing heterozygous C. albicans strains and four of five mice given the 
HWPI revertant strain became ill. Only one mouse that received the revertant strain 
10 remained healthy for seven weeks, hi contrast, none of the five mice that received the hwpl 
null mutant strains of C. albicans became ill. The survival differences between groups of 
mice that received the HWPI heterozygote and revertant strains compared to the hwpl null 
mutant were statistically significant ( P < 0.01). 

1 5 FIGURE 3 depicts the sequence of the 5 " flanking region of HWPI . The upstream region of 
HWPI containing the regulatory sequences has been isolated as a 1.6 Bgl II fragment of 
C. albicans genomic DNA. The cloned HWPI promoter region includes 1.467 kb upstream 
of the ATG start codon of the structural HWPI gene. A putative TATA box was found 67 bp 
upstream form the beginning of the transcript. 

20 

FIGURE 4 illustrates the features of the clones 5' flanking region of the HWPI gene [SEQ. 
ID. NO: 1]. A 10 nucleotide perfect direct repeat was foxmd at positions 554 and 567, A 
TATA box was found 67 bp upstream from the begiiming of the transcript. A search using 
the Matlnspector v2.2 program revealed the presence of 16 NIT2 sites, 2 PH04 sites, 1 
25 MATAl site, and 1 HSF site within the 1,467 bp 5' flanking region. The position and 
direction of the NIT2 sites are shown with — ► below the sequence, the PH04 sites are 

designated by ► , and the CRE site within the PH04 site is shown by ► . A notable 

feature is a pair of NIT2 sites in opposite orientations near position 800 that form an almost 
perfect palindrome on opposite DNA strands. 

30 

FIGURE 5 illustrates the comparison between the GAT99 50-residue DNA binding domain 
[SEQ. ID. NO: 2] with the NIT2 50-residue DNA binding domain [SEQ. ID. NO: 3]. A 
search of the C albicans genome revealed that GAT99 is highly homologous to NIT2. Using 
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the ALIGN program, 92% homology was observed between the two DNA binding domains. 
The asterisks (*) indicate cysteine residues involved in zinc chelation, and the diamond (♦) 
identifies the leucine residue, which when altered in AREA and NIT2 led to different changes 
in promoter recognition. 

5 

FIGURE 6 illustrates 1,441 nucleotides of the 3 'flanking region of HWPl [SEQ. ID. NO: 4]. 
Translation of HWPl ends with a UAA codon followed with 277 nucleotides of untranslated 
message. A polyadenylation site (AAUAAA) 252 nucleotides from the nucleotides from the 
stop codon, and a poly A tail of adenosines are also part of the 3' region of the message. The 
10 polyadenylation site and other sequences potentially important for 3' end formation are 

shown in bold. A truncated open reading frame (bold-italics) consisting of 178 amino acids, 
beginning with ATG and interrupted by the BamHI site at the end of the clone (underlined), 
exists at the 3' end of this region indicating the presence of an additional gene. 

1 5 FIGURE 7 depicts a Northem blot of HWPl mRNA three hours after initiation of mass 

conversion from yeasts to hyphae (strain SC5314) in M199. Each lanes contains 2.5 [ig of 
total RNA. To confirm the presence of equivalent amounts of RNA in each lane, an 1 8S 
ribosomal probe was used as a control. The blots were first probed with HWPl cDNA which 
represents he 5' region of the structural gene. The blot has been overexposed to show that 

20 HWPl mRNA is detectable within 20 minutes and at 7 hrs of placing cells under induction 
conditions (HWPl mRNA is clearly visible in the autoradiograph, but is faint in the 
reproduction). The message level increases dramatically between 20 and 30 minutes and 
continues to rise, peaking at 3-4 hours. After 4 hours, the HWPl mRNA begins to decline 
until 7 hours where it stabilizes at a low level. 

25 

FIGURE 8 illustrates the plasmid construction for the HWPl promoter studies. 1 . 
PBluescriptSK- was used as the backbone. 2. The entire region of HWPl was amplified 
using oligonucleotides that incorporated Sma I and Hind III sites to the ends. The PCR 
product was then digested with Hind III and Sma 1. 3. The digested PCR product was cloned 
30 into pBluescript digested with Hinc 11 and Hind III, creating pBS5'. 4. A GFP gene 

fragment was produced by digesting pYGFP3 with Hind III and Pst I. 5. The digested GFP 
fragment was cloned into pBS5' that had been digested with Hind III and Pst I, creating 
pl47GFP. 6. The 3' region of HWPl was ampHfied with an oligonucleotide that eliminates 



the Hind III site downstream of the mRNA end. The fragment was exposed to Pfu 
polymerase to generate blunt ends, followed by digestion with Sac 1. The HWPl 3' digested 
fragment was cloned into pl47GFP that had been digested with Sma I and Sac I, creating 
pl47GFP3\ This cloning strategy ensured that the HWPl 3' end would be inserted into the 
5 plasmid in the proper orientation. 7. Eno:URA3 was amplified with oUgonucleotides that 
added Kpn I sites at the ends. The URA3 PGR product was digested with Kpn I and cloned 
into a Kpn I digested pl47GFP3', creating pHWPl GFPl . 8. The final construct, 
pHPWlGFPl, contains 1,467 bp of HWPl upstream flanking sequences, 352 bp of 
downstream flanking sequence, and the C. albicans enolase gene disrupted with the C 
10 albicans URA3 gene. The URA3 gene was used for selection in the uraS strain CAM and to 
target one of the ENO gene loci of C albicans. 

FIGURE 9 illustrates the integration of pHWPlGFPl into the chromosome of C. albicans at 
the enolase locus, pHWPlGFPl was linearized with Clal and then used to transform the Ura" 
15 strain CAM. The linearized plasmid was targeted to the ENO locus in the genome using 
spheroplast transformation, resulting in a mutated copy of the enolase gene disrupted with 
URA3 adjacent to a wild type copy. The transformants will be selected on media lacking 
luidine. A control plasmid lacking HWPl upstream region was used as a negative control. 

20 FIGURE 10 depicts a Southern blot to show the stable integration of the HWPl 

promoter:GFP construct at the genomic enolase locus in Ura" strain CAM. Following 
spheroplast transformation, two transformants were produced, GFP1/C2 and GFP1/C3. 
Southern blotting of genomic DNA digested with Ava I confirmed that transformants C2 and 
C3 had integrated GFP into the ENO locus of C. albicans. The integrated construct produced 

25 a 7.8 kb Ava I band that was recognized by both the enolase (lanes 2 and 3) and GFP (lanes 6 
and 7) probes. The undisrupted enolase locus produced a very faint 6.4 kb band (lane 4, 
arrow). The GFP band was not detected in the parental strain CAM, lane 8. 

FIGURE 11 depicts fluorescence microscopy of the GFP-producing transformants. 
30 Fluorescence was used to demonstrate that the HWPl promoterrGFP construct is 

developmentally regulated. The C2 and C3 transformants were grown to stationary phase in 
YNB for 48 hours at room temperature before placement in Ml 99, firesh YNB at 27"*, or in 
the four Lee's media conditions (pH 4.5 and 6.8 at 27'' and 37°). The cells were examined 
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for the production of GFP in germ tubes incubated in Ml 99 and Lee's pH 6.8 at 37°. (A) 
fluorescence was not localized to the germ tubes but was found throughout the cytoplasm of 
C. albicans, (B) the appearance of fluorescence coincided with the induction of germ tube 
formation. (C) yeast did not express GFP. (E) pseudohyphae did not express GFP. Panels 
5 B, and F are controls. 

FIGURE 12 illustrates indirect immunofluorescence (IIF). IIF was used to demonstrate that 
GFP does not interfere with HWPl expression using GFP-producing transformants. 
Secondary anti-rabbit antibodies conjugated with Texas Red rather were used so that Hwp\ 
10 could be detected in the presence of GFP. Panel A shows the GFP-producing transformants. 
Panel B shows Hwpl of the GFP-producing transformants stained with Texas red. 

FIGURE 13 illustrates flow c5^ometry to quantify the production of GFP within the C2 and 
C3 stable tranasformants. The graph in panel B illustrates a three log shift in fluorescence of 
1 5 germ tube compared with the negative control in panel A. Yeast cells that did not germinate 
were used as the negative control in panel A and as illustrated are non-fluorescent. 

FIGURE 14 illustrates the nested set of deletions created to identify the location of cz5-acting 
regulatory sequences upstream of HWPl . The sequential deletions of DNA were performed 

20 by PGR with oligonucleotides to sequences progressively closer to the -1 site. The same 3' 
oligonucleotides (shown as black vertical lines I at fragement ends) used to generate the 
original plamsid construct (pHWPlGFPl) were used. The oligonucleotides for generating 
the nested deletions are shown in panel A [SEQ. ED. NOS: 5-1 1]. The bolded sequence is the 
Xho I restriction enzyme site. Panel B shows the expected PGR products using the 

25 oligonucleotides to generate the sequential deletions of 5' HWPl . The top linear map shows 
the location of the NIT2 pairs below the sequence as •. The location of the TATA box is also 
indicated. The effect of the NIT2 binding sites will be detected in A608, A856, and A1271. 
The second map shows the oligonucleotide binding sites as ■. The smallest PGR product 
will have 83 bp ofDNA upstream from the TAT box. This construct will serve as a minimal 

30 promoter reporter for testing activating functions of regulatory elements. 
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FIGURE 15 illustrates the mini-array-based, transcription analysis of C. albicans genomic 
library to identify and characterize genes under control of DNA binding proteins. This 
method is more sensitive than differential screening. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

It is understood that the present invention is not limited to the particular methodology, 
protocols, cell lines, vectors, and reagents, etc., described herein, as these may vary. It is also 
to be understood that the terminology used herein is used for the purpose of describing 

10 particular embodiments only, and is not intended to limit the scope of the present invention. 
It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a DNA BP" is a reference to one or more DNA BPs and equivalents 
thereof known to those skilled in the art and so forth. 

15 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Preferred methods, devices, and materials are described, although any methods and 
materials similar or equivalent to those described herein can be used in the practice or testing 
of the present invention. All references cited herein are incorporated by reference herein in 

20 their entirety. 

DeHnitions 

Promoter: a recognition site on a DNA sequence or group of DNA sequences that provide an 
25 expression control element for a gene and to which RNA polymerase specifically binds and 
initiates RNA synthesis (transcription) of that gene. 

Inducible promoter: a promoter where the rate of RNA polymerase binding and initiation is 
modulated by extemal stimuK. Such stimuli include light, heat, anaerobic stress, alteration in 
30 nutrient conditions, presence or absence of a metabolite, presence of a ligand, microbial 
attack, wounding and the like. 
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Viral promoter: a promoter with a DNA sequence substantially similar to the promoter 
found at the 5* end of a viral gene. A typical viral promoter is found at the 5' end of the gene 
coding for the p2I protein of MMTV described by Huang et al., 27(2 Pt 1) CELL 245-55 
(1981). 

5 

Synthetic promoter: a promoter that was chemically synthesized rather than biologically 
derived. Usually synthetic promoters incorporate sequence changes that optimize the 
efficiency of RNA polymerase initiation. 

1 0 Constitutive promoter: a promoter where the rate of RNA polymerase binding and initiation 
is approximately constant and relatively independent of extemal stimuli. 

Heterologous Polypeptide: a Hnear series of amino acid residues connected one to the other 
by peptide bonds between the alpha-amino and carboxy groups of adjacent residues 
1 5 originating from a species other than the plant host system within which said linear series is 
produced. "Polypeptide" also encompasses a sequence of amino acids, peptides, fragments of 
polypeptides, proteins, globular proteins, glycoproteins, and fragments of these. 

Multimeric protein: a protein containing more than one separate polypeptide or protein 
20 chain, each associated with the other to form a single protein. Both heterodimeric and 
homodimeric proteins are multimeric proteins. 

Immunoglobulin: a polypeptide, protein or multimeric protein containing at least the 
immunologically active portion of an immunoglobulin heavy chain and is thus capable of 
25 specifically combining with an antigen. Exemplary immunoglobulins are immunoglobulin 
heavy chains, immunoglobulin molecules, substantially intact immunoglobulin molecules, 
any portion of an immunoglobuUn that contains the paratope, including those portions known 
in the art as Fab fragments. Fab' fragment, F(ab^).sub.2 fragment and Fv fragment. 

30 Recombinant: as used herein, broadly describes various technologies whereby genes can be 
cloned, DNA can be sequenced, and protein products can be produced. As used herein, the 
term also describes proteins that have been produced following the transfer of genes into the 
cells of host systems. 
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Fusion protein: a protein in which peptide sequences from different proteins are covalently 
linked together. 

5 Hybridization: broadly defined, any process by which a nucleic acid sequence binds to a 
complementary sequence through base pairing. Hybridization conditions can be defined by, 
for example, the concentrations of salt or formamide in the prehybridization and 
hybridization solutions, or by the hybridization temperature, and are well known in the art. 
Hybridization can occur under conditions of various stringency, hi particular, stringency can 

1 0 be increased by reducing the concentration of salt, increasing the concentration of formamide, 
or raising the hybridization temperature. For example, hybridization under high stringency 
conditions could occur in about 50% formamide at about 3TC to 42°C. Hybridization could 
occur under reduced stringency conditions in about 35% to 25%o formamide at about SO^'C to 
35°C. hi particular, hybridization could occur under high stringency conditions at 42°C in 

15 50%> formamide, 5 times SSPE, 03%> SDS, and 200 ug/ml sheared and denatured salmon 
sperm DNA. Hybridization could occur under reduced stringency conditions as described 
above, but in 35%> formamide at a reduced temperature of 35°C. The temperature range 
corresponding to a particular level of stringency can be fiirther narrowed by calculating the 
purine to pyrimidine ratio of the nucleic acid of interest and adjusting the temperature 

20 accordingly. To remove nonspecific signals, blots can be sequentially washed, for example, 
at room temperature under increasingly stringent conditions of up to 0. 1 X saline sodium 
citrate and 0.5% sodium dodecyl sulfate. Variations on the above ranges and conditions are 
well known in the art. 

25 Isolated: as used herein, refers to any element or compound separated not only from other 

elements or compounds that are present in the natural source of the element or compound, but 
also from other elements or compounds and, as used herein, preferably refers to an element or 
compound found in the presence of (if anything) only a solvent, buffer, ion, or other 
component normally present in a solution of the same. 

30 

Nucleic acid sequences: as the term is used herein, nucleic acid sequences encoding a 
hyphal-specific gene or fimctional equivalent thereof including those with deletions. 
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insertions, or substitutions of different nucleotides resulting in a polynucleotide that encodes 
the same or a functionally equivalent or hyphal-specific protein. Included within this 
definition are polymorphisms which may or may not be readily detectable using a particular 
oligonucleotide probe of the polynucleotide encoding a protein and improper or unexpected 
5 hybridization to alleles, with a locus other than the normal chromosomal locus for the 
polynucleotide sequence encoding a heterologous polypeptide. The encoded protein may 
also be "altered" and contain deletions, insertions, or substitutions of amino acid residues 
which produce a silent change and result in a functionally equivalent protein. Deliberate 
amino acid substitutions maybe made on the basis of similarity in polarity, charge, solubility, 
10 hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the 
biological or immunological activity of a protein is retained. For example, negatively 
p charged amino acids may include aspartic acid and glutamic acid; positively charged amino 
~t acids may include lysine and arginine; and amino acids with uncharged polar head groups 

having similar hydrophilicity values may include leucine, isoleucine, and valine, glycine and 
p 15 alanine, asparagine and glutamine, serine and threonine, and phenylalanine and tyrosine, 
p. The term "nucleic acid sequence," includes an oligonucleotide, nucleotide, or 

- polynucleotide, and fragments thereof, and to DNA or RNA of genomic or synthetic origin 

which may be single- or double-stranded, and represent the sense or antisense strand, to 
^ peptide nucleic acid (PNA), or to any DNA-like or RNA-like material, natural or synthetic in 

p20 origin. "Fragments" include nucleic acid sequences which are greater than about 60 

nucleotides than in length, and most preferably includes fragments that are at least about 100 
nucleotides, at least about 1000 nucleotides, and at least about 10,000 nucleotides in length. 

Antisense gene: an antisense gene is constructed by reversing the orientation of the gene with 
25 respect to its promoter so that the antisense strand is transcribed. 

Antisense RNA: an RNA molecule complementary to a particular RNA transcript that can 
hybridize to the transcript and block its function. 

30 Amino acid sequences: as used herein, this term includes an oligopeptide, peptide, 
polypeptide, or protein sequence, and fragment thereof, and to naturally occurring or 
synthetic molecules. 
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Fragments: include any portion of an amino acid sequence which retains at least one 
structural or functional characteristic of the subject heterologous polypeptides. 



Derivative: as used herein, includes the chemical modification of a nucleic acid encoding or 
5 complementary to a hyphal-specific protein polypeptides of the present invention. Such 
modifications include, for example, replacement of hydrogen by an alkyl, acyl, or amino 
group. A nucleic acid derivative encodes a polypeptide which retains the biological or 
immunological function of the natural molecule. A derivative polypeptide is one which is 
modified by glycosylation, pegylation, or any similar process which retains the biological or 
10 immunological function of the polypeptide firom which it was derived. 

Chemical derivative: as used herein, a molecule is said to be a "chemical derivative" of 
another molecule when it contains additional chemical moieties not normally a part of the 
molecule. Such moieties can improve the molecule's solubility, absorption, biological half- 
15 life, and the like. The moieties can alternatively decrease the toxicity of the molecule, 
eliminate or attenuate any undesirable side effect of the molecule, and the like. 

Complementary or complementarity: as used herein, include the natural binding of 
polynucleotides under permissive salt and temperature conditions by base-pairing. For 

20 example, the sequence "A-G-T" binds to the complementary sequence "T-C-A." 

Complementarity between two single-stranded molecules may be "partial," in which only 
some of the nucleic acids bind, or it may be complete when total complementarity exists 
between the single stranded molecules. The degree of complementarity between nucleic acid 
strands has significant effects on the efficiency and strength of hybridization between nucleic 

25 acid strands. This is of particular importance in amplification reactions, which depend upon 
binding between nucleic acids strands and in the design and use of molecules. 

Deletion: as used herein, refers to a change in the amino acid or nucleotide sequence and 
results in the absence of one or more amino acid residues or nucleotides. 

30 

Insertion or addition: as used herein, includes a change in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, 
respectively, as compared to the naturally occurring molecule. 
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Introduction: insertion of a nucleic acid sequence into a cell, by methods including 
infection, transfection, transformation or transduction. 

5 Transfection: as used herein includes the process of introducing a DNA expression vector 
into a cell. Various methods of transfection are possible including microinjection or 
lipofection. 

Transformation: a process by which exogenous DNA enters and changes a recipient cell. It 
10 may occur imder natural or artificial conditions using various methods well known in the art. 
Transformation may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the 
type of host cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, and lipofection. 

15 

Functional equivalent: a protein or nucleic acid molecule that possesses functional or 
structural characteristics that are substantially similar to a heterologous protein, polypeptide, 
enzyme, or nucleic acid. A functional equivalent of a protein may contain modifications 
depending on the necessity of such modifications for the performance of a specific function. 
20 The term "functional equivalent" is intended to include the "fragments,'* "mutants," 
"hybrids," "variants," "analogs," or "chemical derivatives" of a molecule. 

Variant: an amino acid sequence that is altered by one or more amino acids. The variant 
may have "conservative" changes, wherein a substituted amino acid has similar structural or 

25 chemical properties, e.g., replacement of leucine with isoleucine. More rarely, a variant may 
have "nonconservative" changes, e.g., replacement of a glycine with a tryptophan. 
Analogous minor variations may also include amino acid deletions or insertions, or both. 
Guidance in determining which amino acid residues may be substituted, inserted, or deleted 
may be found using computer programs well known in the art, for example, DNASTAR® 

30 software. 

% similarity or % identity: refer to the percentage of sequence similarity or identity found 
in a comparison of two or more amino acid or nucleic acid sequences. Percent similarity can 
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be determined by methods well-known in the art. For example, percent similarity between 
amino acid sequences can be calculated using the clustal method. See, e.g., Higgins & Sharp, 
73 Gene 237-44 (1988). The clustal algorithm groups sequences into clusters by examining 
the distances between all pairs. The clusters are aligned pairwise and then in groups. The 
5 percentage similarity between two amino acid sequences, e.g., sequence A and sequence B, is 
calculated by dividing the length of sequence A, minus the number of gap residues in 
sequence A, minus the number of gap residues in sequence B, into the sum of the residue 
matches between sequence A and sequence B, times one hundred. Gaps of low or of no 
homology between the two amino acid sequences are not included in determining percentage 
10 similarity. Percent similarity can be calculated by other methods known in the art, for 
example, by varying hybridization conditions, and can be calculated electronically using 
p programs such as the MEGALIGN™ program. (DNASTAR Inc., Madison. Wisconsin.) 

Operably linked: as used herein, refers to the state of any compound, including but not 
ol5 limited to deoxyribonucleic acid, when such compound is functionally linked to any 
JI^ promoter. In the context of the present invention, the nucleic acid sequence is one that 
^- encodes for a hyphal-specific protein. The promoter sequence initiates and mediates 

transcription of the nucleic acid sequence. 

020 Vector: a cloning vector that is designed so that a coding nucleic acid sequence inserted at a 
particular site will be transcribed and translated. A typical expression vector may contain a 
promoter, selection marker, nucleic acids encoding signal sequences, and regulatory 
sequences, e.g., polyadenylation sites, 5 -untranslated regions, and 3 -untranslated regions, 
termination sites, and enhancers. "Vectors" include viral derived vectors, bacterial derived 
25 vectors, plant derived vectors and insect derived vectors. 

Protein purification: broadly defined, any process by which proteins are separated from 
other elements or compounds on the basis of charge, molecular size, or binding affinity. 

30 Substantially purified: as used herein, includes nucleic or amino acid sequences that are 
removed from their natural environment, isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components 
with which they are naturally associated. 
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Expression cassette: is conventional and refers to a combination of regulatory elements that 
are required by the host for the correct transcription and translation (expression) of the 
genetic information contained in the expression cassette. These regulatory elements comprise 
5 a suitable (i.e., functional in the selected host) transcription promoter and a suitable 
transcription termination sequence. 

Promoter: generally includes a regulatory region of DNA capable of initiating, directing and 
mediating the transcription of a nucleic acid sequence. Promoters may additionally comprise 
10 recognition sequences, such as upstream or downstream promoter elements, which may 
influence the transcription rate. Preferably, in the context of the present invention, the 
promoters are hyphal-specific gene promoters. 

Inhibition: as used herein, refers to a reduction in the parameter being measured, whether it 
15 be C. albicans growth or viabiUty. The amount of such reduction is measured relative to a 
standard (control). The prefered detection products may include newly transcribed mRNA 
and a DNA-DNABP complex. "Reduction" is defined herein as a decrease of at least around 
25% relative to control, preferably at least around 50%, and most preferably of at least aound 
75%. 

20 

DNA template: refers to double-stranded DNA and where indicated by the particular binding 
assay to single-stranded DNA that may be negatively supercoiled, possesses a promoter 
region. 

25 Formation of a complex: refers to the binding of a transcription factor(s) to a DNA template. 

Cis-acting element: refers to a variety of modular elements or target sequences. These 
elements may be targets for tissue-specific or temporal regulation. Generally, these elements 
only affect the activity of DNA sequences of its own DNA molecule. These elements may be 
30 found within enhancers, promoters, or other regulatory elements of a particular gene. 

DNA binding protein: refers to transcription factors and other regulatory proteins that 
recognize specific target sequences located in enhancers, promoters, or other regulatory 
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elements that affect a particular target gene. For a repressor protein, binding to DNA may 
itself be sufficient to excersise its function, e.g., inhibit transcription. Alternatively, to be a 
positive regulatory protein, the DNABP may excerise its function by binding to regulatory 
proteins or other transcription factors. 

5 

In accordance with the present invention, a patient preferably includes 
immunocompromised or immunosuppressed humans, for example, those having AIDS or 
undergoing transplantation or anti-cancer therapy. The invention also preferably relates to 
humans with primary or secondary immunodeficiencies (MERCK Manual 16th ed., Chapter 

10 19(1 992)). In addition to mammalian hosts in which the normal immune response has been 
compromised or suppressed, the invention relates to mammalian hosts in which the normal 
microbial flora has been disrupted, for example, because of disease (e.g., hereditary, 
metabolic, infiltrative, or hematologic), trauma (e.g., bum, splenectomy, anesthesia), surgical 
or clinical procedure (e.g., catheterization or introduction of artificial implants such as 

1 5 dentures), or chemical, radiation, or other immunosuppressive prophylaxis or treatment. 
Accordingly, the microbial infection of the present invention includes infections related to 
opportunistic as well as pathogenic microorganisms. 

A prefered embodiment of the present invention involves defining the molecular 
events leading to the expression of a pathogenically important adhesin. In a particular 

20 embodiment of the present invention, this will be accomplished by indentifying new genes 
that are co-regulated with HWPl in a putative global regulatory circuit. In a further 
embodiment, the present invention proposes novel means of interfering with the regulation of 
transcription of these new genes. Preferably, interfering with the regulation of transcription 
of hyphal-specific genes, specifically HWPl. Finally, another emodiment of the present 

25 invention may be used in the development of an alternative or adjunctive therapy for 
candidiasis infections. 

A hallmark of tissue invasion by the yeast C. albicans is the preponderance of hyphal 
forms during growth within tissue. This observation prompts the hypothesis that the ability 
to form hyphae is responsible for the pathogenesis of candidiasis. However, several lines of 

30 evidence suggest that it is not necessarily growth as hyphae that is essential for disease, but 
rather it is the expression of pro-invasive factors that coincide with hyphal growth that 
mediates tissue invasion. In some cases, the lack of filament production is correlated with a 
loss in virulence (Csank et al., 66(6) INFECT. IMMUN. 2713-21 (1998); Ghannoum et al.. 
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63(11) Infect. Immun. 4528-'30 (1995); Lo et al., 90(5) Cell 939-49 (1997); Zaragoza et al, 
180(15) J. Bacterjol, 3809-15 (1998)) but other examples exist where deletion of a specific 
gene product does not impair hyphal growth in tissues or in laboratory cultures, yet virulence 
is reduced in animal models. De Bemardis et al. 179(1) J. Infect. Dis. 201-8 (1999); Leidich 
5 et aL, 273(40) J. BlOL. Chem. 26078-86 (1998). Moreover, a mutant that grows obhgately in 
filamentous forms does not show enhanced virulence. Braun & Johnson, 277(5322) Science 
105-09 (1997); De Bemardis et ah; Leidich et al. Although some studies have suggested that 
mutant strains lacking the ability to form hj^hae are avirulent, it is not known whether the 
loss of virulence is a resuh of the lack or hyphae per se^ or a lack of specific pro-invasive 

10 factors that are expressed during hyphal growth. Finally, studies have shown that strains of 
C. albicans isolated from human patients that are uniformly succesful in forming hyphae, 
vary in their ability to invade tissue in a rat tongue model. Allen et al., 1 8(6) J. Oral 
Pathol. Med. 352-59 (1990). These observations all point to the presence of pro-invasive 
factors that if present in a strain, are produced during hyphal growth. Thus hyphal growth 

15 may be necessary but not sufficient for candidiasis to occur. In order to imderstand the 
pathogenesis of candidiasis, it is important to identify proteins that contribute to 
pathogenesis, and then to discover factors that regulate their expression. The ability to 
interfere with mechanisms of expression of pro-invasive genes will lead to the development 
of strategies to interfere with candidiasis. 

20 HWPl is a developmentally-regulated cell wall protein that is expressed exclusively 

in true hyphae in cultures and in animal tissues. Hwpl has a profound effect on adherence to 
human BECs through a novel mechanism that involves host enzymes, and is important for 
systemic candidiasis in mice. HWPl expression is tightly regulated at the level of mRNA. 
Experiments using cultures growing exclusively as yeasts lack HWPl mRNA, whereas in 

25 cultures of true hyphae, HWPl mRNA is abundant, e.g,. even more abundant than genes for 
glycolytic enzymes. The most plausible explanation for developmental expression is 
differences in transcription of HWPl in yeast and hyphal growth forms. The hypothesis is 
that flanking regions of the HWPl structural gene contain regulatory sequences that affect the 
levels of HWPl mRNA. A corollary is that HWPl is a target of a gene regulatory circuit that 

30 also controls expression of other hypha-specific genes that are pathogenically important. 

Hypha-specific genes, i.e., genes that are expressed in hyphal growth, but not in yeast 
growth, are important virulence factors. Bailey et al., 178(18) J. Bacteriol. 5353-60 (1996); 
Birse et al., 61(9) Infect. Immun. 3648-55 (1993); Staab et al. (1996). The hypha-specific 
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gene HWPl encodes an unconventional adhesin that is necessary for the pathogenesis of 
candidiasis. Staab et aL (1999). HWPl attaches tightly to host cells by virtue of being a 
substrate for mammalian transglutaminase, thereby forming isodipeptide bonds to proteins on 
surfaces of host cells. Besides being an adhesin, HWPl influences germ tube formation in 
5 the special case where blastospores are dropped on the surface of an agar plate. Sharkley et 
al. 181(17) J. Bacteriol. 5273-9 (1999). It is possible that the defective germ tube 
formation on surfaces of agar of the null mutant may be indicative of a defect in germ tube 
formation in growth in host tissues; but if so, the defect is subtle because germ tube formation 
in animal tissues, in liquid cultures, and in embedded agar media are normal for the hwp7 null 

1 0 mutant. It is more likely that the null mutant is aviralent because of defects in adherence to 
host cells. Other hypha-specific genes include an aspartyl proteinase (SAP6) and proteins 
predicted to be localized to the cell wall (ALS3 and HYR). Bailey et al.; Hoyer et al., 33(6) 
CURR. Genet. 451-59 (1998). Proteinase activity and conventional adherence through cell 
wall proteins other than HWPl are potentially important virulence attributes. Given the 

1 5 importance of HWPl in pathogenesis, and the potential roles for other hypha-specific 

proteins in pathogenesis based on their probable functions, the mechanisms regulating hypha- 
specific gene expression are of interest for defining strategies to inhibit candidiasis. 

The regulation of expression of hypha-specific genes is controlled by the presence of 
mRNA in hyphae but not in yeasts. The presence of multiple genes that are correlated with 

20 hyphal growth, and the common control determined by the presence of mRNA, suggests that 
common regulatory mechanisms may exist for inducing expression of hypha-specific genes. 
Initiation of transcription control is mediated by c/^-acting sequences, termed gene response 
elements, that interact with DNA binding proteins to repress or activate transcription in 
response to environmental signals. Upstream Activating Sequence (UAS) and Upstream 

25 Regulatory Sequence (URS) elements serve to activate or repress gene expression though 
interactions with DNA binding proteins. Hypha-specific genes are expected to have both 
UAS and URS elements that are responsible for turning on genes in hyphal growth and 
repressing them during yeast growth. The presence of common cz^-acting elements in 
different genes indicates that a mechanism for coordinate regulation of a set of genes that are 

30 appropriate for a given set of environmental conditions exists. Analysis and determination of 
the mechanisms regulating expression of these genes may also provide insight into the 
mechanism of the yeast/hypha transition in C. albicans. 
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Although tranS'SLcting proteins involved in gene expression of defined hypha-specific 
genes have not been reported, white-phase specific factors have been identified for the yeast- 
specific gene WHll, which was discovered upon analysis of the white/opaque switching 
system of C. albicans. The function of regions upstream of WHll and OP4 are the best 
5 analyzed and understood to date. A fiinctional analysis of the WHll promoter region has led 
to the identification of upstream c/^-acting regulatory sequences that form complexes with 
putative ^ran^-acting proteins in cell extracts (Srikantha et aL, 15(3) MOL. Cell. Biol, 1797- 
805 (1995)), whereas a MADS box protein consensus binding site has been shown to be 
important for expression ofOP4, Lockhart et aL, 180(24) J. Bacteriol. 6607-16 (1998), 

10 Further, the repression of WHll expression in C. albicans hyphae support the involvement of 
transcriptional control in the yeast/hypha transition. Importantly, the fiinctional determinants 
in the WHll promoter were not foimd have sequence similarity to c/^-acting regulatory 
regions fi*om other organisms indicating that C albicans probably has regulatory factors with 
features that are distinct from those of mammalian hosts and would therefore serve as suitable 

15 targets for development of antifungal strategies. 

The mechanisms that control expression of hypha-specific genes may also regulate 
the bud/hypha transition. Work on HWPl has shown that HWPl expression always coincides 
with hypha production, suggesting that common mechanisms may regulate morphogenesis 
and hypha-specific gene expression. Therefore knowledge about dimorphism is relevant. 

20 Pseudohyphal (elongated yeasts) growth of S, cerevisiae was used as a model for 

pseudohyphal and hyphal growth of C. albicans, STE12 of 5. cerevisiae^ the target of the 
pheromone response mitogen-activated protein kinase cascade and other upstream kinases, 
was found to be important for pseudohyphal growth of S. cerevisiae, Liu et aL, 262(5140) 
Science 1741-44 (1993). Homologs of members of this pathway were similarly found to be 

25 important in pseudohyphal and hyphal growth of C albicans. Kohler et aL, 93(23) Prog. 
Natl. Acad. Scl USA 13223-28 (1996); Leberer et aL, 93(23) Proc. Natl. Acad. Scl 
USA 13217-22 (1996); Liu et aL, 262(5140) Science 1741-44 (1993). A C albicans STE12 
homologue, CPHl, was also found to be necessary for production of pseudohyphae of C. 
albicans on nitrogen poor sohd media. Kohler & Fink, 266(5191) Science 1723-6 (1994), 

30 However, CPHl mutants still formed pseudohyphae and true hyphae when grown in hquid 
media and in response to serum. These results show that CPHl influences growth 
morphology in C. albicans^ but, probably plays a limited role, and other unidentified factors 
are also required. 
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A second signal transduction pathway that is involved in pseudohyphal growth of S. 
cerevisiae is the cAMP/PKA pathway. Increasing the level of cAMP artificially by mutation 
in a phosphodiesterase, or by activating mutations of Rasl protein GTPase or by adding 
exogenous cAMP enhances pseudohyphal formation. Gimeno et aL, 68(6) Cell 1077-90 
5 (1992); Lorenz & Heitman, 16(23) EMBO J. 7008-18 (1997); Ward et al., 15(12) MOL. Cell. 
Biol. 6854-63 (1995). Mutations that decrease the levels of cellular cAMP result in 
decreased filamentation presumably due to the reduction of cAMP. Kubler et al., 272(33) J. 
BlOL. Chem. 20321-3 (1997); Lorenz & Heitman, 16(23) EMBO J. 7008-18 (1997). The 
regulation of filamentation by cAMP levels also extends to other filamentuous fungi. 
10 Madhani & Fink, 8(9) TRENDS CELL. BlOL. 348-53 (1998). Lideed, a gene that is required for 
normal cAMP levels is also necessary for both dimorphism and HWPl production, 
p The MAPK and cAMP signaling pathways effect expression of the cell wall protein 

gents oiS. cerevisiae. Both pathways affect the promoter of 7, whose gene product is 
a cell wall protein required for pseudohyphal growth. Rupp et aL, 18(5) EMBO J. 1257-69 
ril5 (1999). The FLOU promoter is unusually large (2.8 kb), and has at least four UAS and nine 
pi URS elements. Similarly, the intergenic region between HWPl and APL2 is also large 
- (approximately 2.0 kb), suggesting that a conservation in the complexity required for 

regulation of filamentation genes may exist; HWPl probably also has multiple UAS and URS 
elements. 

o20 The C. albicans HWPl gene is likely under the control of the cAMP-dependent 

signalling pathway but not the MAPK pathway, in that HWPl expression was not altered in a 
cphl null mutant but was totally blocked in an efgl mutant. Sharkey et aL, 1 84(17) J. 
Bacteriol. 5273-9 (1999). Efglp is beheved to be a transcriptional target of the cAMP 
dependent signaling pathway in C. albicans because of a putative proteinase K modification 

25 site. Efglp is also necessary for formation of germ tubes and for virulence in an animal 

model. HWPl expression is also reduced in mutants lacking the RBFl gene which encodes a 
different transcription factor that has not been associated with a particular signal transduction 
pathway. Ishii et al., 143(Pt 2) MICROBIOL. 429-35 (1997). Indeed, expression of hypha- 
specific genes and production of hyphae themselves may thus share common regulatory 

30 features. 

The environmental cues that activate bud/hypha signalling cascades are xmknown, but 
historical data strongly suggest that nitrogen regulatory circuits are involved. In many fungi, 
nitrogen utilization is important not only for the synthesis of essential metobolites, but also 
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for interconversion between growth morphologies that are important for environmental 
adaptation. Spore germination in Aspergillus and Rhizopus is preferentially induced in poor 
nitrogen sources such as prohne. Weber & Ogawa, 55 Phytopath. 262-66 (1965); Yanagita, 
26 Arch. Microbiol. 329-44 (1957). In diploid strains of 5'. cerevisiae, nitrogen starvation 
5 or use of proline as a nitrogen source leads to production of pseudohyphal cells which invade 
solid agar, and are thought to be necessary for growth in the natural environment. Gimeno et 
al., 68(6) Cell 1077-90 (1992). In C. albicans, the use of proline or selected other amino 
acids induces production of true hyphae, as well as pseudohyphae from yeast forms. 
Dabrowa et al., 13(3) Infect. Immun. 830-35 (1976); Holmes & Shepherd, 133(Pt II) J. Gen. 
10 Microbiol, 3219-28 (1987); Land et al., 11(5) Infect. Immun. 1014-23 (1975). 

Morphologic variation in response to proline is also found in plant pathogenic ftingi and is 

O thought to be important for pathogenesis. Kulkami & Nickerson, 5 Exp. Mycol. 145-54 
(1981). Thus morphologic variation that accompanies the use of nitrogen sources is a 

ry common feature of fungal growth. 

q15 The control of nitrogen supply by prokaryotic and eukaryotic organisms is highly 

:^ regulated. Fungi are able to utilize an array of compounds as nitrogen sources, and have 

evolved the capability to express different catabolic enzymes to make nitrogen available to 
the cell. Nitrogen metablism regulation has been well studied in S, cerevisiae^ Aspergillus 
^^f nidulans, and Neurospora crassa. When preferred sources of nitrogen such as ammonia, 
D20 glutamine, and glutamate are not available or are in concentrations too low to support growth, 
the synthesis of pathway-specific catabolic enzymes and permeases are derepressed (53). 
Activating global regulatory genes in Aspergillus (areA; (Caddick, Molecular Biology of 
Filimentous Fungi 141-52 (1992); Kudla et al., 9(5) EMBO J. 1355-64 (1990)), Neurospora 
(nit~2; (Fu & Marzluf, 7(5) MOL. Cell. Biol. 1691-96 (1987); Stewart & Vollmer, 46(2-3) 
25 Gene 291-95 (1986)), Saccharomyces (gln-S; Minehart & Magasanik, 1 1(12) MOL. Cell. 
Biol, 6216-28 (1991)), and Penicillum (nre; Haas et al., 27(2) CuRR. Genet. 150-58 (1995)) 
have been found that code for GATA-type zinc finger transcription factors which activate 
specific catabolic genes when preferred nitrogen sources are lacking. These regulatory 
proteins all have a conserved DNA binding domain which consists of a single Cys2/Cys2-type 
30 zinc finger motif with a central loop of 17 amino acid residues. Marzluf, 61(1) MOL. BlOL. 
Rev. 17-32 (1997). The amino acid conservation in the DNA binding region is high among 
the different members of this GATA family. Mutations of conserved residues in the DNA 
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binding domains of NIT2 or AREA lead to complete lack of DNA binding in vitro and are 
nonfunctional in vivo, Fu & Marzluf, 4(1 1) MOL. MiCROBiOL. 1847-52 (1990). 

URS elements may indeed mediate repression of filamentous growth in C. albicans 
strains. Strains that lack the S. cerevisiae repressor protein homologue, TUPl of C albicans 
5 grow exclusively in filamentous form. Braun & Johnson, 277(5322) Science 105-09 ( 1997). 
Filamentous forms of tup 1 mutant cells range from short pseudohyphae to long thread-like 
structures resembling true hyphae depending on the environmental and media conditions. 
The TUPl gene of C. albicans was studied based on its role in S. cerevisiae where it serves as 
a pleiotropic repressor for many genes through protein-protein interactions with DNA 

10 binding proteins. Indeed, there may exist in yeast forms of C albicans of DNA-binding 
repressor proteins that strongly regulate the bud/hypha transition. 

TUPl may partially repress expression of hypha-specific genes. In a differential 
screen for genes that are expressed in tupl mutant cells, but not in TUPl parental cells, the 
mRNA for the hypha-specific gene, HWPl, was found. Sharkey et al. reported that HWPl 

15 was deprepressed in a tupl mutant. Sharkley et al. However, when tupl mutants were 
analyzed, surface HWPl protein was not detected on hyphal surfaces under most culture 
conditions, suggesting that release from TUPl repression permits only partial HWPl 
expression and that additional proteins are required for normal levels and surface localization 
of hypha-specific gene products. Indeed, the upstream regions of hypha-specific genes of C 

20 albicans may contain sites for proteins that interact with the TUPl gene product, or perhaps 
with Tupl itself; however, interaction with Tupl is less likely because S, cerevisiae Tup 1 
protein is not a DNA binding protein, but functions through interactions with other DNA 
binding proteins. The partial derepression of HWPl in tupl mutants incidates that proteins 
other than TUPl repress HWPl in yeasts. 

25 A powerfiil method for identifying UAS and URS is to fiise DNA upstream of a gene 

coding regions to a reporter gene whose protein level can be easily measured. URS elements 
are identified by increased reporter gene expression relative to the intact DNA upon deletion 
of the element. Conversely, UAS elements required for expression are identified by a loss of 
reporter gene activity upon deletion of the element. A second approach for identifying UAS 

30 elements is to design individual, overlapping bp sequence elements spanning the entire DNA 
sequence, and then look for activation of reporter gene expression relative to a simple 
promoter containing a TATA element. Guarente & Ptashne, 78(4) Proc. Natl. Acad. Sci. 
USA 2199-203 (1981); Rupp et al. 18(5) EMBO J. 1257-69 (1999). 



-"26- 



The FLO 11 gene upstream region, which is similar to HWPl in having an unusually large 
upstream region (Rupp et al. 18(5) EMBO J. 1257-69 (1999)), was studied using the lacZ 
reporter gene whereas the luciferase reporter gene was used to study regulatory regions in C 
albicans genes that were under the control of switching. Guarente, The MOLECULAR AND 
5 Cellular Biology of the Yeast Saccharomyces cerevisiae (E. Jones, J. Pringle, and J. 
Broach ed.) (1992), 

Green Fluorescent Protein (GFP) has proven to be a valuable marker of gene expression 
in studies employing optical microscopy in research in microbial pathogenesis. The 
advantages of GFP have been made available for studies in C. albicans by B. Cormack et al., 

10 who optimized all the GFP codons for expression in C. albicans, Cormack et al., 143(Pt 2) 
Microbiol. 303-11 (1997). GFP within C albicans is readily visible by fluorescence 
microscopy and is easily quantified by flow cytometry. GFP in solution is also quantifiable 
by fluorometry. Kahn et al., 7(4) Curr. Biol. R207-08 (1997). These advances indicate that 
GFP will be an excellent reporter for studies on inducible expression of HWPl in C 

15 albicans. 

The transcriptional regulation of gene expression through c/^-acting sequences is 
controlled by DNA binding proteins. Schleif, 241(4870) SCIENCE 1 182-87 (1988). Evidence 
for the presence of unknown DNA binding proteins is provided by electrophoretic mobility 
shift assays, where proteins in crude nuclear extracts of cells growing in appropriate 

20 conditions are assayed for their abihty to retard the migration of labeled UAS or URS 

elements in polyacrylamide gels. Mutations or deletions in the UAS are employed to verify 
the specificity of the interactions of DNA binding proteins with the UAS element and to map 
the most important nucleotides for DNA binding. The presence of unknown binding proteins 
to UAS elements provides the groundwork for cloning genes encoding regulatory proteins. 

25 Several strategies have been successfully used to clone genes encoding proteins that bind 

to UAS elements. A particularly successful method is to use DNA affinity beads that consist 
of UAS elements that have incorporated biotinylated nucleotides to facilitate coupling to 
streptavidin beads. Gabrielsen & Huet, 218 METHODS Enzymol. 508-25 (1993). Nuclear 
extracts are then incubated with the beads to separate the desired DNA binding proteins from 

30 other proteins in the extract, followed by elufion of the DNA binding protein from the beads, 
purification on SDS polyacrylamide gels and amino acid sequencing of HPLC-purified 
proteolytic fragments. The amino acid sequences are then used to design degenerate primers 
for amplification of a PGR product from cDNA to be used as a probe in library screening to 
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obtain the DNA binding protein gene. Verification of the biological activity of the DNA 
binding protein is gained from creating null mutants in the gene and looking for loss of 
expression of the reporter gene under the control of the specific UAS elements used to 
identify and characterize the protein. The pattern of expression of the DNA binding protein 
5 gene itself, along with the predicted primary amino acid sequence, provide insight into 

regulatory mechanisms. Predictions about whether expression is mediated by transcription, 
or by protein modification, are possible. In the event that the DNA binding protein gene is 
regulated by transcription, its non-flanking regions provide material for identification of cz5- 
acting regulatory regions as proposed for the primary target gene, HWPl . 

1 0 One of the benefits of cloning gene(s) encoding proteins that bind to the promoter of a 

target gene like HWPJ is that global gene regulatory networks can be readily sought. Target 
genes that are beUeved to be activated by the same mechanism will have reduced levels of 
mRNA in a null mutant lacking the DNA binding protein gene. For example, expression of 
sporulation specific genes ofS. cerevisiae is regulated by the IMEl gene. Deletion of IMEl 

15 results in the loss of expression of an entire family of sporulation-specific genes. Mitchell, 
58(1) Microbiol. Rev. 56-70 (1994). Within the context of the present invention, all of the 
known hypha-specific genes will be assessed for expression in a null mutant of the DNA 
binding protein gene compared to cells having the gene, thereby revealing the importance of 
the DNA binding protein in regulating a set of genes. 

20 Another aspect of the present invention is the identification of new genes that are 

regulated by hypha-specific controls, because such genes are likely to be important in 
virulence. The null mutant of the putative DNA binding protein genes serves as a tool to 
achieve this goal as well, for identification of genes with low message levels compared to 
mRNA fi-om cells with the wild-type DNA binding protein gene. The ideal methodology for 

25 the identification of new genes is genome wide expression monitoring. DeRisi & Iyer, 1 1(1) 
CURR. Opin. Oncol. 76-79 (1999). The proven utility of genome wide expression 
monitoring in revealing previously unidentified genes that are up or down-regulated under 
any given environmental conditions has been demonstrated multiple times. For example, 
new sporulation-specific genes, and regulatory circuits were identified using genome-wide 

30 expression monitoring of the S, cerevisiae genome. Cox et al., 15(8) YEAST 703-13 (1999). 
Mammalian genes in fibroblasts that are regulated in the presence of serum have also been 
identified using genome wide expression monitoring with the available but incomplete 
mammahan genome sequences. Iyer et aL, 283(5398) SCIENCE 83-87 (1999), In both cases. 
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unexpected genes were identified underscoring the value of the technique. Similar studies are 
plausible using the C. albicans genome. The sequence of the C. albicans genome has been 
completed to five times coverage. 

Given the high significance of the identification of new genes with potential 
5 importance in virulence in an organism where random mutagenesis approaches are not 

possible, exploration of alternatives other than genome wide expression monitoring may be 
warranted. Consequently, a "work-around" method that does not require the production of 
gene-specific probes may be utilized. Primers that hybridize to vector sequences are 
employed to amplify small genomic inserts that are robotically spotted on membranes to 
10 generate mini-arrays. This array method is being used for genome wide expression 
monitoring in H, capsulatum. 

Opportunities for proliferation and invasion of mammalian hosts by C. albicans are 
continuing to increase. Because of the loss of the cellular component of the immune system, 
AIDS patients are susceptible to invasion of submucosal tissue by C albicans. The 
1 5 fi-equency of candidal infections may also be a result of the prophylactic use of antibacterial 
drugs used in AIDS patients to minimize other opportunistic infections. Candidal infections 
increase in severity and recur more firequently as the immunodeficiency progresses. In non- 
AIDS patients, such as those undergoing organ transplantation, are neutropenic, or have 
debilitating diseases requiring advanced modalities of life support, mucosal and 
20 hematogenously disseminated candidiasis seriously threaten optimal treatment outcomes. 
While antifungal drugs can be effective, the increasing frequency of resistant strains of C 
albicans, and the systemic side effects of the drugs prompts exploration of novel strategies to 
interrupt the sequence of events leading to disease and to expand the repertoire of antifiingal 
drugs. Both goals require improved understanding of hypha-specific proteins of C albicans 
25 in terms of their roles in pathogenicity and in the mechanisms that regulate their expression. 
By defining the molecular events leading to the expression of a pathogenically important 
adhesin, and through the identification of new genes that are co-regulated with HWPl in a 
putative global regulatory circuit, the present invention relates to new and novel ways to 
interfere with candidiasis. The long term medical benefits of this study may be the 
30 development of alternative or adjunctive therapies based on the knowledge expression of 
hypha-specific genes in C albicans. 

Recombinant techniques 
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The DNA molecules of the present invention may be employed by recombinant 
techniques. Thus, for example, the DNA molecule sequence may be included in any one of a 
variety of expression vehicles, in particular vectors or plasmids for expressing such a 
polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
5 sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA; viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other vector or plasmid may be 
used as long as they are repHcable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 

10 procedures. In general, the DNA sequence is inserted into an appropriate restriction 

endonuclease sites by procedures known in the art. Such procedures and others are deemed 
to be within the scope of those skilled in the art. 

The DNA sequence in the expression vector may be operatively linked to an 
appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 

1 5 representative examples of such promoters, there may be mentioned: LTR or S V40 promoter, 
the E. coli. lac or trp, the phage lambda Pl promoter and other promoters known to control 
expression of genes in prokaryotic or eukaryotic cells or their viruses. The expression vector 
also contains a ribosome binding site for translation initiation and a transcription terminator. 
The vector may also include appropriate sequences for amplifying expression. 

20 In addition, the expression vectors preferably may contain a gene to provide a 

phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampiciUin 
resistance in E, coli. 

An embodiment of the invention comprises an isolated DNA molecule comprising the 
25 nucleotide sequence of the HWP gene promoter. This nucleotide sequence, or fragments or 
functional equivalents thereof, may be used to generate recombinant DNA molecules that 
direct the expression of the polypeptides of the present invention, or functionally active 
peptides or functional equivalents thereof, in appropriate host cells. Due to the degeneracy of 
the nucleotide coding sequence, other DNA sequences which encode substantially the same 
30 amino acid sequences as depicted, or analogs or fragments thereof, may be used in the 

practice of the invention for the cloning and expression of such a promoter. Such alterations 
include deletions, additions or substitutions of different nucleotide residues resulting in a 
sequence that encodes the same or a functionally equivalent gene product. The gene product 
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may contain deletions, additions or substitutions of amino acid residues within the sequence, 
which result in a silent change thus producing a bioactive product. Such amino acid 
substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophihcity, the amphipathic nature of the residues involved and/or on the 

5 basis of crystallographic data. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; amino 
acids with uncharged polar head groups having similar hydrophihcity values include the 
following: leucine, isoleucine, vahne; glycine, alanine; asparagine, glutamine; serine, 
threonine; phenylalanine, tyrosine. 

10 Techniques well known to those skilled in the art for the isolation of DNA, generation 

of appropriate restriction fragments, construction of clones and libraries, and screening 
recombinants may be used. For a review of such techniques, see, for example, Maniatis et 
AL., Molecular Cloning: A Laboratory Manual, Chapters 1-18 (2''^ Ed., Cold Spring 
Harbor Laboratory 1989). Also, the 5' untranslated and coding regions of the nucleotide 

1 5 sequence could be altered to improve the translational efficiency of the mRNA. hi addition, 
based on X-ray crystallographic data, sequence alterations could be undertaken to improve 
protein stability, e.g., introducing disulfide bridges at the appropriate positions, and/or 
deleting or replacing amino acids that are predicted to cause protein instability. These are 
only examples of modifications that can be engineered to produce a more active or stable 

20 protein, more protein, or even change the substrate specificity of the protein. 

The vector containing the appropriate DNA sequence, as well as an appropriate 
promoter or control sequence, may be employed to transform an appropriate host to permit 
the host to express the polypeptides of the present invention. Representative examples of 
appropriate hosts include: bacterial cells, such as E, coli. Salmonella typhimurium, 

25 Streptomyces; fungal cells, such as yeast; insect cells, such as Drosophila S2 and Spodoptera 
Sf9; animal cells, such as CHO, COS or Bowes melanoma; adenoviruses; plant cells, etc.. 
The selection of an appropriate host and transformation technique within the scope of those 
skilled in the art. 

More particularly, the present invention also includes recombinant constructs 
30 comprising one or more of the sequences of the present invention. The constructs may 

comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct may further comprise regulatory sequences, including, for 
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example, a promoter operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 
(Qiagen), pBS, phagescript, psiX174, pBluescript SK, pBsKS, pNHSa, pNH16a, pNHlSa, 
5 pNH46a (Stratagene); pTRC99A, pKK223-3, pKK233-3, pDR540, PRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, 
PSVL (Pharmacia). However, any other plasmid or vector may be used as long as they are 
replicable and viable in the host. 

In a further embodiment, the present invention relates to host cells containing the 

10 above-described construct. The host cell can be a higher eukaryotic cell, such as a 

mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. The host cell preferably may secrete the 
recombinant protein. Introduction of the construct into the host cell can be effected by 
calcium phosphate transfection, DEAE-Dextran mediated transfection, or electroporation (L. 

1 5 Davis et al. , Basic Methods in Molecular Biology, 1 986)). 

Promoters 

Suitable promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 

20 appropriate vectors are pKX232-8 and pCM7. Particular named bacterial promoters include 
laci, lacZ, T3, T7, gpt, lambda Pr, PLand trp. Eukaryotic promoters include CMV immediate 
early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse 
metallothionein-I. Selection of the appropriate vector and promoter is well within the level of 
skill in the art. The constructs in host cells can be used in a conventional manner to produce 

25 the gene product encoded by the recombinant sequence. 

Enhancers 

Transcription of a DNA encoding the polypeptides of the present invention by higher 
eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers 
30 are cis-acting elements of DNA, usually from about 10 to 300 bp, that act on a promoter to 
increase its transcription. Examples include the SV40 enhancer on the late side of the 
replication origin (base pair 100 to 270), a cytomegalovirus early promoter enhancer, a 
polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. 
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Selectable markers 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g,, the ampicillin resistance 
5 gene of E, coli and S. cerevisiae TRPl gene, and a promoter derived from a highly-expressed 
gene to direct transcription of a downstream structural sequence. Such promoters can be 
derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 
(PGK), alpha factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is preferably assembled in appropriate phase with 
10 translation, initiation and termination sequences, and preferably, a leader sequence capable of 
directing secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

15 

Expression vectors 

Useftil expression vectors for bacterial use may be constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation, 
initiation and termination signals in operable reading phase with a functional promoter. The 

20 vector may comprise one or more phenotypic selectable markers and an origin of repHcation 
to ensure maintenance of the vector and to, if desirable, provide amplification within the host. 
Suitable prokaryotic hosts for transformation include E. coli. Bacillus subtilis. Salmonella 
typhimurium and various species within the genera Pseudomonas, Streptomyces^ and 
Staphylococcus, although others may also be employed as a matter of choice. 

25 As a representative but nonlimiting example, usefiil expression vectors for bacterial 

use can comprise a selectable marker and bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of the well known cloning 
vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and GEMl (Promega Biotec, Madison, WI). 

30 These pBR322 backbone sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformafion of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter may be de-repressed by appropriate means 
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(e.g., temperature shift or chemical induction) and cells maybe cultured for an additional 
period. Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
5 convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
of cell lysing agents. Various mammalian cell culture systems can also be employed to 
express recombinant polypeptides. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts, described by Gluzman and other cell lines capable 
of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell 

10 lines. Gluzman, 23(1) Cell 175-82 (1981). Mammalian expression vectors may comprise an 
origin of repUcation, a suitable promoter and enhancer, and also any necessary ribosome 
binding sites, polyadenylation sites, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA sequences derived 
from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, and 

1 5 polyadenylation sites may be used to provide the required nontranscribed genetic elements. 

Gene Therapy 

Another embodiment of the present invention involves the use of gene therapy 
applications, which will interfere with the transcription mediated by cis acting sequences of 

20 hyphal-specific genes. Gene therapy has been broadly defined as "the correction of a disease 
phenotype through the introduction of new genetic information into the affected organism." 
Roemer & Friedmann, 208(2) EUR. J. BIOCHEM. 21 1-25 (1992), Two basic approaches to 
gene therapy have evolved: {!) ex vivo gene therapy and (2) in vivo gene therapy. In ex vivo 
gene therapy, cells are removed from a subject and cultured in vitro, A functional 

25 replacement gene is introduced into the cells (transfection) in vitro, the modified cells are 
expanded in culture, and then reimplanted in the subject. These genetically modified, 
reimplanted cells are reported to secrete detectable levels of the transfected gene product in 
situ. Miller, 76(2) Blood 271-78 (1990); Selden et aL, 317(17) New Eng. J, Med. 1067-76 
(1987). The development of improved retroviral gene transfer methods (transduction) 

30 facilitates the transfer into and subsequent expression of genetic material by somatic cells. 

Cepko et al., 37(3) Cell 1053-62 (1984). Accordingly, retrovirus-mediated gene transfer has 
been used in clinical trials to mark autologous cells and as a way of treating genetic disease. 
Rosenberg et aL, 323(9) New Eng. J. Med. 570-78 (1990); Anderson, 2 Hum. Gene Ther. 
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99-1 00 (1991), Several ex vivo gene therapy studies in humans have been reported. 
Anderson, 256(5058) Science 808-13 (1992); Miller, 357(6378) Nature 455-60 (1992). 

In in vivo gene therapy, target cells are not removed from the subject. Rather, the 
transferred gene is introduced into cells of the recipient organism in situ, that is, within the 
5 recipient. In vivo gene therapy has been examined in several animal models. Feigner & 
Rhodes, 349(6307) NATURE 351-52 (1991). Publications have reported the feasibility of 
direct gene transfer in situ into organs and tissues such as muscle (Ferry et al., 88(19) Proc. 
Natl. Acad. Sci. USA 8377-81 (1991); Quantin et al., 89(7) Proc, Natl. Acad. Scl USA 
2581-84 (1992)), hematopoietic stem cells (Clapp et al., 78(4) BLOOD 1 132-39 (1991)), the 

10 arterial wall (Nabel et al., 244(4910) Science 1342-44 (1989)), the nervous system (Price et 
al., 84(1) Proc, Natl. Acad. Scl USA 156-60 (1987)), and lung (Rosenfeld et al., 
252(5004) Science 431-34 (1991)). Direct injection of DNA into skeletal muscle (Wolff et 
ah, 247(4949 Pt 1) SCIENCE 1465-68 (1990)), heart muscle (Kitsis et al., 88(10) PROC. Natl. 
Acad. Scl USA 4138-42 (1991)) and injection of DNA-lipid complexes into the vasculature 

15 (Lim et al, 83(6) CIRCULATION 2007-1 1 (1991); Ledere et al„ 90(3) J, Clin. Invest 936-44 
(1992); Chapman et ah, 71(1) CiRC. Res. 27-33 (1992)) also have been reported to yield a 
detectable expression level of the inserted gene product(s) in vivo. 

Gene therapy efforts have been aimed at the identification of various cell types for 
transformation, including keratinocytes (Morgan et al., 237(4821) SCIENCE 1476-79 (1987)), 

20 fibroblasts (Palmer et al., 88(4) Proc. Natl. Acad. Scl USA 1330-34 (1991); Garver, Jr. et 
al., 237(4816) Science 762-64 (1987); hitemational Patent Application PCT/US92/01890, 
having publication number WO 92/15676), lymphocytes (Reimann et al., 89(1) J. IMMUNOL. 
Methods 93-101 (1986)), myoblasts (Barr & Leiden, 254(5037) Science 1507-09 (1991); 
Dai et aL, 89(22) Proc. Natl. Acad. Scl USA 10892^5 (1992); Roman et al., 18(3) Somat. 

25 Cell Mol. Genet. 247-58 (1992)), smooth muscle cells (Lynch et al, 89(3) Proc. Natl. 
Acad. Scl USA 1 138-42 (1992)), and epithelial cells (Nabel et ah, 244(4910) Science 
1342-4 (1989), Intemational Patent Application PCT/US89/05575, having publication 
number WO 90/06997), the contents of which references and patent/patent applications are 
incorporated herein by reference. 

30 The delivery of an effective dose of a prophylactic or therapeutic agent in situ 

depends on the efficiency of transfection (or transduction) as well as the number of target 
cells. Epithelial cell-based gene therapy, in particular, involves a relatively small area 
available in situ for receiving genetically modified epithelial cells. The delivery of an 
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effective dose of prophylactic or therapeutic agent in situ thus depends upon the total number 
of implanted epithelial cells. 

hi one embodiment of the invention, exogenous genetic material (e.g., a cDNA 
encoding a polypeptide of the present invention) is introduced into a syngeneic host cell ex 
5 vivo or in vivo by genetic transfer methods^ such as transfection or transduction, to provide a 
genetically modified host cell Various expression vectors (i.e., vehicles for facihtating 
delivery of exogenous genetic material into a target cell) are known to one skilled in the art. 

Transfection refers to the insertion of nucleic acid into a mammalian host cell using 
physical or chemical methods. Several transfection techniques are known to those of 

10 ordinary skill in the art including: calcium phosphate DNA co-precipitation (Methods in 
Molecular Biology, Vol. 7, Gene Transfer and Expression Protocols, Ed. E. J. Murray, 
Humana Press (1991)); DEAE-dextran; electroporation; cationic liposome-mediated 
transfection; and tungsten particle-facilitated microparticle bombardment. Johnston, 
346(6286) Nature 776-77 (1990). Strontium phosphate DNA co-precipitation is a preferred 

1 5 transfection method. Brash et al, 7(5) Molec. Cell. Biol. 203 1 -34 (1 987). 

In contrast, transduction refers to the process of transferring nucleic acid into a cell 
using a DNA or RNA virus. A RNA virus (i.e., a retrovirus) for transferring a nucleic acid 
into a cell is referred to herein as a transducing chimeric retrovirus. Exogenous genetic 
material contained within the retrovirus is incorporated into the genome of the transduced 

20 host cell. A host cell that has been transduced with a chimeric DNA virus (e.g., an 

adenovirus carrying a cDNA encoding a therapeutic agent) will not have the exogenous 
genetic material incorporated into its genome, but will be capable of expressing the 
exogenous genetic material that is retained extrachromosomally within the cell. 

Typically, the exogenous genetic material includes the heterologous gene (usually in 

25 the form of a cDNA comprising the exons coding for the therapeutic protein) together with a 
promoter to control transcription of the new gene. The promoter characteristically has a 
specific nucleotide sequence necessary to initiate transcription. Optionally, the exogenous 
genetic material further includes additional sequences (i.e., enhancers) required to obtain the 
desired gene transcription activity. For the purpose of this discussion an enhancer is simply 

30 any non-translated DNA sequence which works contiguous with the coding sequence (in cis) 
to change the basal transcription level dictated by the promoter. Preferably, the exogenous 
genetic material is introduced into the host cell genome immediately downstream from the 
promoter so that the promoter and coding sequence are operatively linked so as to permit 
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transcription of the coding sequence. A preferred retroviral expression vector includes an 
exogenous promoter element to control transcription of the inserted exogenous gene. Such 
exogenous promoters include both constitutive and inducible promoters. 

Naturally-occurring constitutive promoters control the expression of essential cell 
5 functions. As a result, a gene under the control of a constitutive promoter is expressed under 
all conditions of cell growth. Exemplary constitutive promoters include the promoters for the 
following genes which encode certain constitutive or housekeeping functions: hypoxanthine 
phosphoribosyl transferase (HPRT), dihydrofolate reductase (DHFR) (Scharfmann et aL, 
88(1 1) Proc. Natl. Acad. Scl USA 4626-30 (1991)), adenosine deaminase, 
1 0 phosphoglycerol kinase (PGK), pyruvate kinase, phosphoglycerol mutase, the beta -actin 
promoter (Ch'ng et al., 86(24) Proc. Natl. Acad. Scl USA 10006-10 (1989)), and other 
constitutive promoters known to those of skill in the art. In addition, many viral promoters 
function constitutively in eukaryotic cells. These include: the early and late promoters of 
SV40, the long terminal repeats (LTRs) of Moloney Leukemia Virus and other retroviruses, 
1 5 and the thymidine kinase promoter of Herpes Simplex Virus, among many others. 

Accordingly, any such constitutive promoters can be used to control transcription of a 
heterologous gene insert. 

Genes that are under the control of inducible promoters are expressed only or to a 
greater degree, in the presence of an inducing agent, (e.g., transcription imder control of the 
20 metallothionein promoter is greatly increased in presence of certain metal ions). Inducible 
promoters include responsive elements (REs) which stimulate transcription when their 
inducing factors are bound. For example, there are REs for serum factors, steroid hormones, 
retinoic acid and cyclic AMP. Promoters containing a particular RE can be chosen in order 
to obtain an inducible response, and in some cases, the RE itself may be attached to a 
25 different promoter, thereby conferring inducibiUty to the recombinant gene. Thus, by 

selecting the appropriate promoter (constitutive versus inducible; strong versus weak), it is 
possible to control both the existence and level of expression of a therapeutic agent in the 
genetically modified host cell. If the gene encoding the prophylactic or therapeutic agent is 
under the control of an inducible promoter, delivery of the agent in situ is triggered by 
30 exposing the genetically modified cell in situ to conditions for permitting transcription of the 
prophylactic or therapeutic agent, e.g., by intraperitoneal injection of specific inducers of the 
inducible promoters which control transcription of the agent. For example, in situ expression 
by genetically modified host cells of a therapeutic agent encoded by a gene under the control 
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of the metallothionein promoter, is enhanced by contacting the genetically modified cells 
with a solution containing the appropriate (i.e., inducing) metal ions in situ. 

Accordingly, the amount of therapeutic agent that is delivered in situ is regulated by 
controlling such factors as: (1) the nature of the promoter used to direct transcription of the 
5 inserted gene (i.e., whether the promoter is constitutive or inducible, strong or weak); (2) the 
number of copies of the exogenous gene that are inserted into the host cell; (3) the number of 
transduced/transfected host cells that are administered (e.g., implanted) to the patient; (4) the 
size of the implant (e.g., graft or encapsulated expression system); (5) the number of 
implants; (6) the length of time the transduced/transfected cells or implants are left in place; 
10 and (7) the production rate of the prophylactic or therapeutic agent by the genetically 

modified host cell. Selection and optimization of these factors for delivery of an effective 
dose of a particular prophylactic or therapeutic agent is deemed to be within the scope of one 
of skill in the art, taking into account the above-disclosed factors and the clinical profile of 
the patient. 

15 In addition to at least one promoter and at least one heterologous nucleic acid 

encoding the prophylactic or therapeutic agent, the expression vector preferably includes a 
selection gene, for example, a neomycin resistance gene, for facilitating selection of host cells 
that have been transfected or transduced with the expression vector. Alternatively, the host 
cells are transfected with two or more expression vectors, at least one vector containing the 

20 gene(s) encoding the prophylactic or therapeutic agent(s), the other vector containing a 

selection gene. The selection of a suitable promoter, enhancer^ selection gene and/or signal 
sequence is deemed to be within the scope of one skilled in the art. 

The prophylactic or therapeutic agent can be targeted for delivery to an extracellular, 
intracellular or membrane location. If it is desirable for the gene product to be secreted from 

25 the host cells, the expression vector is designed to include an appropriate secretion signal 
sequence for secreting the therapeutic gene product from the cell to the extracellular milieu. 
If it is desirable for the gene product to be retained within the host cell, this secretion signal 
sequence is omitted. In a similar manner, the expression vector can be constructed to include 
retention signal sequences for anchoring the prophylactic or therapeutic agent within the host 

30 cell plasma membrane. For example, membrane proteins have hydrophobic transmembrane 
regions that stop translocation of the protein in the membrane and do not allow the protein to 
be secreted. The construction of an expression vector including signal sequences for 
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targeting a gene product to a particular location is deemed to be within the scope of one of 
skill in the art. 

In an embodiment, vectors for mammalian host cell gene therapy are viruses, more 
preferably replication-deficient viruses (described in detail below). Exemplary viral vectors 
5 are derived from: Harvey Sarcoma vims; ROUS Sarcoma virus, MPSV, Moloney murine 
leukemia virus and DNA viruses {e,g., adenovirus). Temin, Gene Transfer 149-187 
(Kucherlapati, R., ed.) (1986). 

Replication-deficient retroviruses are capable of directing synthesis of virion proteins, 
but are incapable of making infectious particles. Accordingly, these genetically altered 

10 retroviral expression vectors have general utility for high-efficiency transduction of genes in 
cultured cells, and specific utility for use in the methods of the present invention. Such 
retroviruses further have utility for the efficient transduction of genes into host cells in vivo. 
Retroviruses have been used extensively for transferring genetic material into cells. Standard 
protocols for producing replication-deficient retroviruses (including the steps of incorporation 

15 of exogenous genetic material into a plasmid, transfection of a packaging cell line with 

plasmid, production of recombinant retroviruses by the packaging cell line, collection of viral 
particles ft-om tissue culture media, and infection of the target cells with the viral particles) 
are provided in Kriegler, Gene Transfer and Expression, A Laboratory Manual 
(Murray, E.L, ed.)(1990). 

20 The major advantage of using retroviruses for gene therapy is that the viruses insert 

the gene encoding the therapeutic agent into the host cell genome, thereby permitting the 
exogenous genetic material to be passed on to the progeny of the cell when it divides. In 
addition, gene promoter sequences in the LTR region have been reported to enhance 
expression of an inserted coding sequence in a variety of cell types. See e.g., Hilberg et aL, 

25 84(15) Proc. Natl. Acad. Scl USA 5232-36 (1987); Holland et al., 84(23) Proc, Natl. 
Acad. Scl USA 8662-66 (1987); Valerio et al., 84(2) Gene 419-27 (1989). In vivo gene 
therapy using replication-deficient retroviral vectors to deliver a therapeutically effective 
amount of a therapeutic agent can be efficacious if the efficiency of transduction is high 
and/or the number of target cells available for transduction is high. 

30 Yet another viral candidate useful as an expression vector for transformation of 

mammalian host cells is the adenovirus, a double-stranded DNA virus. The adenovirus is 
firequently responsible for respiratory tract infections in humans and thus appears to have an 
avidity for the epithelium of the respiratory tract, Straus, The Adenovirus 45 1-496 (H.S. 
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Ginsberg, ed.) (1984). Moreover, the adenovirus is infective in a wide range of cell types, 
including, for example, muscle and epithelial cells. Larrick & Burck, Gene Therapy. 
Application of Molecular Biology 71-104 (1991). The adenovirus also has been used as 
an expression vector in muscle cells in vivo, Quantin et al., 89(7) Proc, Natl, Acad. Sci. 
5 USA 2581(1992). 

Like the retrovirus, the adenovirus genome is adaptable for use as an expression 
vector for gene therapy, i.e., by removing the genetic information that controls production of 
the virus itself. Rosenfeld et al., 252(5004) Science 431-34 (1991). Because the adenovirus 
functions in an extrachromosomal fashion, the recombinant adenovirus does not have the 
1 0 theoretical problem of insertional mutagenesis. 

Thus, as will be apparent to one skilled in the art, a variety of suitable viral expression 

o vectors are available for transferring exogenous genetic material into mammalian host cells. 
The selection of an appropriate expression vector to express an agent for the identification, 
prevention or treatment of microbial infection amenable to gene replacement therapy and the 

J^{l5 optimization of the conditions for insertion of the selected expression vector into the cell are 

"Z^ within the scope of one of skill in the art without the need for undue experimentation. 

s In an alternative embodiment, the expression vector is in the form of a plasmid, which 

is transferred into the target host cells by one of a variety of methods: physical (e.g., 
microinjection (Capecchi, 22(2 Pt 2) Cell 479-88 (1980)), electroporation (Andreason & 

O20 Evans, 6(7) Biotechniques 650-60 (1988)), scrape loading, microparticle bombardment 
(Johnston, 346(6286) Nature 116-11 (1990)) or by cellular uptake as a chemical complex 
(e.g., calcium or strontium co-precipitation, complexation with lipid, complexation with 
ligand) (Methods in Molecular Biology, Vol. 7, Gene Transfer and Expression 
Protocols, (E. J. Murray, ed.) (1991)). Several commercial products are available for 
25 cationic liposome complexation including Lipofectin™ (Life Technologies, Inc., 

Gaithersburg, MD) (Feigner et al., 84(21) Proc. Natl. Acad. Scl USA 7413-17 (1987)) and 
Transfectam™ (ProMega, Madison, Wis.). Behr et al., 86(18) Proc. Natl. Acad. Scl USA 
6982-86 (1989); Loeffler et al., 54(5) J. Neurochem. 1812-15 (1990). However, the 
efficiency of transfection by these methods is highly dependent on the nature of the target cell 
30 and accordingly, the conditions for optimal transfection of nucleic acids into host cells using 
the above-mentioned procedures must be optimized. Such optimization is within the scope of 
one of skill in the art. 
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In an embodiment, the preparation of genetically modified host cells contains an 
amount of cells sufficient to deliver a prophylactically or therapeutically effective dose of a 
substrate for mammalian transglutaminases of the present invention to the recipient in situ. 
The determination of an effective dose of the prophylactic or therapeutic agent for a known 
5 microbial infection is within the scope of one of skill in the art. Thus, in determining the 
effective dose, the skilled artisan would consider the condition of the patient, the severity of 
the condition, as well as the results of clinical studies of the prophylactic or therapeutic agent 
being administered. 

If the genetically modified host cells are not already present in a pharmaceutically 

10 acceptable carrier, they are placed in such a carrier prior to administration to the recipient. 
Such pharmaceutically acceptable carriers include, for example, isotonic saline and other 
buffers as appropriate to the patient and therapy. The genetically modified cells are 
administered by, for example, intraperitoneal injecting or implanting the cells or a graft or 
capsule containing the cells in a host cell-compatible site of the recipient. As used herein, 

15 host cell-compatible site refers to a structure, cavity or fluid of the recipient into which the 
genetically modified cell(s), host cell graft, or encapsulated host cell expression system can 
be implanted, without triggering adverse physiological consequences. Representative host 
cell-compatible sites include, for example, the peritoneal, pleural and pericardial cavities. 
Preferably, the host cell-compatible site communicates with the lymphatic system, thereby 

20 enabling delivery of the therapeutic agent to the vascular system. 

In one embodiment, the host cell-compatible site may be denuded prior to implanting 
the cells. Exemplary denuding methods include but are not limited to: (1) injection of 
distilled water into the site (e.g., the peritoneal cavity) for 20 minutes, followed by scraping 
off a portion of the epithelial layer; (2) injection of 0.1% buffered trypsin for 20 minutes 

25 followed by scraping; (3) removal of epithelial cells by gentle scraping with a cell scraper 
and (4) touching a piece of Gelfilm (Upjohn, Kalamazoo, MI) to the endothelium. 

The genetically modified host cells are implanted in a host cell-compatible site, alone 
or in combination with other genetically modified host cells. Thus, the instant invention 
embraces a method for modifying the epithelial system of a recipient by using a mixture of 

30 genetically modified host cells, such that a first modified cell expresses a first prophylactic or 
therapeutic agent of the present invention and a second modified cell expresses a second 
prophylactic or therapeuric agent. Other genetically modified cell types (e.g., hepatocytes, 
smooth muscle cells, fibroblasts, glial cells, mesothelial cells or keratinocytes) can be added, 
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together with the genetically altered epithelial cells, to produce expression of a complex set 
of introduced genes. Moreover, more than one recombinant gene can be introduced into each 
genetically modified cell on the same or different vectors, thereby allowing the expression of 
multiple prophylactic or therapeutic agents of the present invention by a single cell. 
5 The instant invention further embraces an epithelial cell graft. The graft comprises a 

plurality of the above-described genetically modified cells attached to a support that is 
suitable for implantation into a mammalian recipient, preferably into the oral cavity. The 
support can be formed of a natural or synthetic material. According to another aspect of the 
invention, an encapsulated host cell expression system is provided. The encapsulated system 

10 includes a capsule suitable for implantation into a mammalian recipient and a plurality of the 
above-described genetically modified host cells contained therein. The capsule can be 
formed of a synthetic or naturally-occurring material. The formulation of such capsules is 
known to one of ordinary skill in the art. In contrast to the host cells that are directly 
implanted into the mammalian recipient (i.e., implanted in a manner such that the genetically 

1 5 modified cells are in direct physical contact with the host cell-compatible site), the 

encapsulated cells remain isolated (i.e., not in direct physical contact with the site) following 
implantation. Thus, the encapsulated host cell system is not limited to a capsule including 
genetically-modified non-immortaKzed host cells, but may contain genetically modified 
immortalized host cells, 

20 

Polypeptides 

As used herein, polypeptide refers to a linear series of amino acid residues connected 
to one another by peptide bonds between the alpha-amino groups and carboxy groups of 
adjacent amino acid residues. Additional covalent bonds between portions of the peptide are 

25 also present to restrain the conformation of the molecule, such as amide and disulfide bonds. 
When used herein, protein also refers to a linear series of amino acid residues connected one 
to the other as in a peptide. The term synthetic peptide means a chemically derived chain of 
amino acid residues linked together by peptide bonds that is fi*ee of naturally occurring 
proteins and fi*agments thereof. 

30 Polypeptides of the present invention may include any analog, fragment or chemical 

derivative of the polypeptides capable of inhibiting transglutaminase-mediated microbial 
interaction with a mammalian host. Polypeptides thus may include soluble peptides, Ig-tailed 
fusion peptides, members of random peptide libraries {see, e.g.. Lam et aL, 354(6348) 
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Nature 82-84 (1991); Houghten et al., 354(6348) Nature 84-86 (1991)), combinatorial 
chemistry-derived molecular library made of D-and/or L- configuration amino acids, and 
phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries, see, e.g., Songyang et al., 72(5) Cell 767-78(1993)). 
5 Such polypeptides may include those derived from mammahan transglutaminases and 

those derived from mammalian transglutaminase substrates. The term analog refers to any 
polypeptide having an amino acid sequence, in comparison to the amino acid sequences of 
the polypeptides of the present invention, in which one or more amino acids have been 
substituted with other amino acids; where the substituted amino acids allow or require the 
10 polypeptide to assume the equilibrium conformation of the domain of the parent protein. 
Often, cysteine, lysine and glutamic acid will be used for their side chains which can form 
covalent linkages to restrict the conformation of a peptide. 

The term analog shall also include any polypeptide which has one or more amino 
acids deleted from or added to an amino acid sequence of a mammalian transglutaminase 
1 5 substrate, but which still retains inhibitory activity against transglutaminase-mediated 

microbial interaction with a mammalian host. The term fragment shall refer to any shorter 
version of the polypeptides, wherein the fragment is capable of inhibiting transglutaminase- 
mediated microbial interaction with a mammalian host. 

Of course, the present polypeptides may also be prepared by recombinant DNA 
20 techniques. The present invention also relates to vectors comprising DNA molecules of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. Host cells 
may be genetically engineered (transduced or transformed or transfected) with the vectors of 
this invention which may be, for example, a cloning vector or an expression vector. The 
25 vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc. The 

engineered host cells can be cultured in conventional nutrient media modified as appropriate 
for activating promoters, selecting transformants or amplifying genes. The culture 
conditions, such as temperature, pH and the like, are preferably those previously used with 
the host cell selected for expression, and will be apparent to the skilled artisan. Mature 
30 proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control 
of appropriate promoters. Cell-free translation systems can also be employed to produce such 
proteins using RNAs derived from the DNA constructs of the present invention. Appropriate 
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cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by 
Sambrook et al., supra. 

The polypeptides of the present invention may be recovered and purified from 
recombinant cell cultures by methods used heretofore, including ammonium sulfate or 
5 ethanol precipitation, acid extraction, anion or cation exchange chromatography, 

phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxyapatite chromatography and lectin chromatography. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
Finally, high performance liquid chromatography (HPLC) can be employed for final 

1 0 purification steps . 

The polypeptides of the present invention may be a naturally purified product, or a 
product of chemical synthetic-procedures, or produced by recombinant techniques firom a 
prokaryotic or eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a recombinant 

15 production procedure, the polypeptides of the present invention may be glycosylated with 
mammalian or other eukaryotic carbohydrates or may be non-glycosylated. Polypeptides of 
the invention may also include an initial methionine amino acid residue. 

Any peptide of the present invention may be used in the form of a pharmaceutically 
acceptable salt. Suitable acids which are capable of forming salts with the peptides of the 

20 present invention include inorganic acids such as hydrochloric acid, hydrobromic acid, 

perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, phosphoric acid and the like; and 
organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, 
pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, fiimaric acid, anthranilic 
acid, cirmamic acid, naphthalene sulfonic acid, sulfanilic acid, and the like. 

25 Suitable bases capable of forming salts with the peptides of the present invention 

include inorganic bases such as sodium hydroxide, ammonium hydroxide, potassium 
hydroxide and the like; and organic bases such as mono-, di- and tri-alkyl and aryl amines 
(e.g., triethylamine, diisopropyl amine, methyl amine, dimethyl amine and the like) and 
optionally substituted ethanolamines (e.g., ethanolamine, diethanolamine and the like). 

30 For use in a method of identification, prevention or treatment, such as the 

identification, prevention or treatment of infection of a mammalian host by a microorganism, 
the polypeptides of the present invention may be present in a pharmaceutical composition in 
admixture with a pharmaceutically acceptable sterile vehicle. The pharmaceutical 



composition may be compounded according to conventional pharmaceutical formulation 
techniques. 

The vehicle may take a wide variety of forms depending on the form of preparation 
desired for administration, e.g., sublingual, rectal, nasal, oral or parenteral Compositions for 
5 oral dosage form may include any of the usual pharmaceutical media, such as, for example, 
water, oils, alcohols, flavoring agents, preservatives, coloring agents and the Hke in the case 
of oral liquid preparations (e.g., suspensions, elixirs and solutions) or carriers such as 
starches, sugars, diluents, granulating agents, lubricants, binders, disintegrating agents and 
the like in the case of oral solid preparations (e.g., powders, capsules and tablets). Controlled 
10 release forms may also be used. Because of their ease in administration, tablets and capsules 
represent an advantageous oral dosage unit form, in which case solid pharmaceutical carriers 
may be employed. If desired, tablets may be sugar coated or enteric coated by standard 
techniques. 

For compositions to be administered parenterally, the carrier will usually comprise 
1 5 sterile water, although other ingredients to aid solubility or for preservation purposes may be 
included. Injectable suspensions may also be prepared, in which case appropriate liquid 
carriers, suspending agents and the like may be employed. The parenteral routes of 
administration may be intravenous injection, intramuscular injection or subcutaneous 
injection. 

20 For intravenous administration, the polypeptides may be dissolved in an appropriate 

intravenous delivery vehicle containing physiologically compatible substances such as 
sodium chloride, glycine and the like, having a buffered pH compatible with physiologic 
conditions. Such intravenous delivery vehicles are known to those skilled in the art. 

The polypeptides of the invention may be administered to subjects where alteration of 

25 the hypha-specific gene expression is desired. The peptides may be administered by any 
convenient means. Oral administration is presently contemplated as a preferred 
administration route. The amount administered will depend on the activity of the particular 
compound administered, which may readily be determined by those of ordinary skill in the 
art. 

30 

Antibodies 

Antibodies of the present invention may include any polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain antibodies, and FAb, F(ab')2 and FAb 
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expression library fragments, and epitope-binding fragments thereof, specific to the 
polypeptides of the present invention. 

Another embodiment of the present invention relates to a monoclonal antibody to the 
polypeptides of the present invention (or an antigenic portion thereof), which may be 
5 produced by methods recognized in the art, including the formation of monoclonal antibody- 
producing hybridomas. Kohler & Milstein, 256(5517) Nature 495-97 (1975); Kohler & 
Milstein 6(7) EuR. J. Immunol. 511-19 (1976). 

In an embodiment of the invention, purified polypeptides of the present invention (or 
an antigenic portion thereof) can be used as an antigen or immunogen. In addition, 
10 microorganisms expressing HWPl promoter regions, protein or polypeptide fragments 
thereof also represent potential antigens or sources of antigen with which to immunize 
□ animals to obtain somatic cells for fusion. Somatic cells with the potential for producing 
ri antibody and, in particular, B lymphocytes, are suitable for fusion with a B-cell myeloma 
n.1 line. Those antibody-producing cells that are in the dividing plasmablast stage fuse 
fjX5 preferentially. Somatic cells may be derived from the lymph nodes, spleens and peripheral 
Jl" blood of primed animals and the lymphatic cells of choice depending to a large extent on 
^ their empirical usefulness in the particular fusion system. Once-primed or hyperimmunized 

animals can be used as a source of antibody-producing lymphocytes. Mouse lymphocytes 
^If give a higher percentage of stable fusions with mouse myeloma lines. However, the use of 
1:320 rat, rabbit, and frog cells is also possible. Alternatively, human somatic cells capable of 
producing antibody, specifically B lymphocytes, are suitable for fusion with myeloma cell 
lines. While B l3miphocytes from biopsied spleens or lymph nodes of individual may be 
used, the more easily accessible peripheral blood B lymphocytes are preferred. The 
lymphocytes may be derived from patients with diagnosed carcinomas. 
25 Specialized myeloma cell lines have been developed from lymphocyte tumors for use 

in hybridoma-producing fusion procedures. Kohler, & Milstein, 6(7) EUR. J. IMMUNOL. 511- 
19 (1976); Schulman et aL, 276(5685) Nature 269-70 (1978). Examples of myeloma cell 
lines that may be used for the production of fused cell hybrids include X63-Ag8, NSI-Ag4/l, 
MPC11-45.6TG1.7, C63-Ag8.653, Sp2/0-Agl4, FO, and S194/5XX0.BU.1, all derived from 
30 mice; 210.RCY3.Agl.2.3, U-226AR, and GM1500GTGAL2, all derived from rats; and U- 
226AR and GM1500GTGAL2, derived from humans, (G.J. Hammerling, U. Hanmierling, 
and J.F. Keamey (editors). Monoclonal Antibodies and T-cell Hybridomas in: J.L. Turk 
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(editor) Research Monographs in Immunology, Vol. 3, Elsevier/North Holland 
Biomedical Press, NY (1981)). 

Methods for generating hybrids of antibody-producing spleen or lymph node cells and 
myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 proportion 
(though the proportion may vary from about 20:1 to about 1:1), respectively, in the presence 
of an agent or agents (chemical or electrical) that promote the fusion of cell membranes. It is 
often preferred that the same species of animal serve as the source of the somatic and 
myeloma cells used in the fusion procedure. Fusion methods have been described by Kohler 
and Milstein (Kohler & Milstein, 256(5517) Nature 495-97 (1975); Kohler & Milstein, 6 
Eur. J. Immunol. 511-19 (1976)), and by Gefter et al. (3 Somatic Cell Genet. 231-36 
(1977)). The fusion-promotion agents used by those investigators were Sendai virus and 
polyethylene glycol (PEG), respectively. 

The monoclonal antibodies of this invention can be used as probes in detecting 
discrete antigens expressed by microorganisms. The expression or lack of expression of 
these antigens can provide clinically exploitable information that is not apparent after 
standard histopathological evaluations. It may thus be possible to correlate the immuno- 
phenotypes of individual microorganisms with various aspects of microbial-mammalian host 
interaction and responsiveness to certain types of therapies, thereby establishing important 
classifications of prognosis. 

The antibodies may also be used to detect drug resistance in microorganisms. For 
example, drug resistant C. albicans can make hyphae (germ tubes) in the presence of drug, 
but susceptible strains cannot. See, Gallagher et aL, 138 J. Gen. Microbiol. 1901-11 (1992). 
Because HWPl is a marker of hyphae formation, the detection of the presence or absence of 
HWPl may be useful in the monitoring of drug resistance in C. albicans. 

The use of the monoclonal antibodies described herein can be extended to the 
screening of human biological fluids for the presence of the specific antigenic determinant 
recognized. In vitro immunoserological evaluation of sera withdrawn firom patients thereby 
permits non-invasive diagnosis of microbial infection. By way of illustration, human fluids, 
such as pleural fluids or lymph, can be taken firom a patient and assayed for the specific 
epitope, either as released antigen or membrane-bound on cells in the sample fluid, using 
monoclonal antibodies against the polypeptides of the present invention in standard 
radioimmunoassays or enzyme-linked immunoassays known in the art or competitive binding 
enzyme-linked immunoassays. 



The monoclonal antibodies of this invention are potentially usefixl for targeting 
microbial infection in wvo. They can therefore be used in humans for localization and 
monitoring of the microbial infection. For this appHcation, it is preferable to use purified 
monoclonal antibodies. Purification of monoclonal antibodies for human administration by 
5 ammonium sulfate or sodium sulfate precipitation followed by dialysis against sahne and 
filtration sterilization has been described. Miller et al., Hybridomas in Cancer Diagnosis 
AND Therapy 134(1982). 

Alternatively, immunoaffmity chromatography techniques may be used to purify the 
monoclonal antibodies. The purified monoclonal antibodies can be labeled with radioactive 
10 compounds, for instance, radioactive iodine, and administered to a patient intravenously. 
After locahzation of the antibodies at the infection site, they can be detected by emission 
tomographical and radionuclear scanning techniques, thereby pinpointing the location of the 
infection. Experimental radioimmunodetection with monoclonal antibodies may occur by 
external scintigraphy. 

15 Passive monoclonal serotherapy may be a potential use for the monoclonal antibodies 

of this invention. By way of illustration, purified anti-HWPl monoclonal antibody is 
dissolved in an appropriate carrier, e.g., saline, with or without human albumin, at an 
appropriate dosage and is administered to a patient. The monoclonal antibodies are 
preferably administered intravenously, e.g., by continuous intravenous infusion over several 

20 hours, as in Miller et al.. Infusions can be administered over a period of weeks during which 
the anti-microbial effects are monitored. 

In an altemate embodiment, the antibodies described herein are used to stimulate the 
production of corresponding anti-idiotypic antibodies. In brief, anti-idiotypic antibodies, or 
antiidiotypes are antibodies directed against the antigen combining region or variable region 

25 (idiotype) of another antibody. Based on Jeme's network model of idiotypic relationships 
(Jeme, 125 Ann. Immunol. 373 (1974); Jeme et al., 1 EMBO 234 (1982)), immunization 
with an antibody molecule expressing a paratope (antigen-combining site) for a given antigen 
should produce a group of anti-antibodies, some of which share with the antigen a 
complementary structure to the paratope. Immunization with a subpopulation of antiidiotypic 

30 antibodies should in tum produce a subpopulation of antiidiotypic antibodies which bind the 
initial antigen. Thus, the administration of the monoclonal antibodies of the present 
invention may result in a modification of the host's anti-microbial immune response, as the 
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consequence of the formation of anti -idiotypic antibodies which may develop during therapy 
with the monoclonals. 

The monoclonal antibodies of this invention can be used in conjunction with a broad 
spectrum of pharmaceutical or cytotoxic agents that selectively affect a microorganism over 
5 the mammalian host. The methods used for binding the cytotoxic agents to the monoclonal 
antibody molecule can involve either non-covalent or covalent linkages. Since non-covalent 
bonds are more likely to be broken before the antibody complex reaches the target site, 
covalent linkages are preferred. For instance, carbodiimide can be used to link carboxy 
groups of the pharmaceutical agent to amino groups of the antibody molecule. Bifunctional 

10 agents such as dialdehydes or imidoesters can be used to link the amino group of a drug to 
amino groups of the antibody molecule. The Schiff base reaction can be used to link drugs to 
antibody molecules. This method involves the periodate oxidation of a drug or cytotoxic 
agent that contains a glycol or hydroxy group, thus forming an aldehyde that is then reacted 
with the antibody molecule. Attachment occurs via formation of a Schiff base with amino 

15 groups of the antibody molecule. Additionally, drugs with reactive sulfhydryl groups have 
been coupled to antibody molecules. 

Another embodiment of the invention relates to a diagnostic kit for detecting a 
microorganism expressing a protein. The diagnostic kit may further comprise, where 
necessary, other components of the signal producing system, including agents for reducing 

20 background interference, control reagents, or an apparatus, container or other solid support 
for conducting the test. The binding of antibody to the target can be detected by well known 
methods, including radiation (e.g., use of a radioactive nucleotide), colorimetry (e.g., use of 
an enzyme that can cause a color change in a substrate), fluorescence (e.g., use of a dye such 
as propidium iodide, fluorescein, or rhodamine), and luminescence (e.g., use of an alkaline 

25 phosphatase substrate that releases photons upon cleavage or luciferin). Detection can be 
qualitative or quantitative. 

The invention has been disclosed broadly and illustrated in reference to representative 
embodiments described above. Those skilled in the art will recognize that various 
modifications can be made to the present invention without departing fi^-om the spirit and 

30 scope thereof. 

Without further elaboration, it is believed that one skilled in the art, using the 
preceding description, can utilize the present invention to the fullest extent. The following 
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examples are illustrative only, and not limiting of the remainder of the disclosure in any way 
whatsoever. 

EXAMPLES 

5 Example 1: Role of HWPl in the formation of candidiasis 
L Structural features of the gene product of HWPl, 

The complete HWPl gene was obtained by screening a C albicans genomic library 
with cDNA which had been isolated following immunoscreening of a germ tube cDNA 
library with hypha-specific antibodies. Staab et al. (1996). 
10 HWPl encodes a protein of 634 amino acids with an N-terminal antigenic domain that 

is more hydrophilic, more acidic, more likely to be present on the protein surface and has a 

0 higher antigenic index than the remainder of HWPL hi contrast to the uniqueness of the 

\| antigenic domain that was previously described, the central and C-terminal regions of HWPl 

1 shares features that have been found in other yeast cell wall proteins, including a high 
Ol5 percentage of serine and threonine residues that leads to extensive O-linked carbohydrate 
rj modification. The two serine/threonine-rich regions (from ILE188 to CYS389, and from 

THR390 to GLU612) are distinguishible by their mole percent of these two amino acids. In 
M the first and second regions respectively, 54% and 30% of the residues are serine or 
threonine. The different content of serine and threonine residues may reflect distinct 
y 20 functions for each region. The second region is also rich in alanine and glutamine (20% of 
residues) and has a higher content of proline residues than the first region. The discovery of 
an abrupt increase in serine and threonine residues at the C-terminal boundary of the 
antigenic domain of HWPl suggests that the "lollipop on a stick" model, in which a C- 
terminal domain rich in hydroxy amino acids serves to extend a binding domain above the 
25 cell surface, may help explain the exposure of the antigenic domain of HWPl at the cell 
surface. Jentoft, 15 Tr. Biochem. Sci. 291-94 (1990). The absence of glycosylation sites 
resulting in a lack of carbohydrate masking in the A^terminal repetitive domain strongly 
suggest that this peptide sequence is important for interactions with host surfaces. 

The first serine/threonine region shows similarity (BLASTP algorithm, Altschul et al., 
30 215 J. MOL. Biol. 403-10 (1990)) to mucins from different species because of small stretches 
of threonine residues in HWPl, while the second serine/threonine region did not show 
similarity to proteins in the databanks. Two 29 amino acid direct repeats separated by 5 
residues were found in this latter region. The amino acid repeats were identical except for a 
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conserved change of alanine to valine at position 22 within the repeat. Li addition to multiple 
sites for O-Unked carbohydrate addition, three potential /V-glycosylation sites were identified 
at amino acid positions 241, 286 and 601, all of which were ASN-Xaa-SER. Finally, the 
hydrophobic C terminus in HWPl is similar to other yeast cell membrane or cell wall 
5 proteins whose maturation involves transfer to a glycosylphosphatidylinositol (GPI) anchor. 

HWPl is acidic, having 63 negative charges at neutral pH, and an isoelectric point of 
3.37, which is expected to contribute to the strong negative charge of surfaces of C. albicans 
hyphae. Following signal peptidase cleavage after amino acid 39 as previously predicted, 
HWPl has a calculated Mr of 61,122. Staab et ah (1996). 
1 0 The complete predicted open reading frame supports several inferences about the 

mechanisms leading to the expression of HWPl on surfaces of C albicans. The hydrophobic 
^7;: N and C termini are consistent with entry into the classical secretory pathway and 
-J modification with a GPI anchor (Englund, 62 Annu. Rev. Biochem. 121-38 (1993)) 
i S respectively, as has been described for other y^asX cell wall proteins in which GPI anchor 
addition precedes covalent linkage of proteins to the cell wall. Lu et al. 14(7) MOL. Cell. 
r; Biol. 4825-33 (1994). 

H II, Creation of mutant strains of C albicans lacking HWPl^ and a revertant that are 
.|j otherwise isogenic with the parental strain. 

20 Two genomic Bam HI fragments containing HWPl half-clones were isolated fi*om a 

genomic library constructed in Lambda GEM- 12 (Promega). Birse et al. The Bam HI 
firagments were cloned in pBluescript SK- to generate pGB8 (1 .0 kb insert) and pGB23 (3.0 
kb insert). An unintemipted HWPl gene was created by ligating the 3.0 kb insert of pGB23 
into pGB8 to create pGBHWPl. An inactive HWPl gene was created by replacing HWPl 

25 DNA between the Bel I and Bgl II sites (394 bp) with the 4.0 kb hisG-URAS-hisG cassette of 
p5921. Fonzi & Irwin, 134 GENETICS 717-28 (1993). The new plasmid containing an 
interrupted HWPl was named pHWPlURA3 (Figure 1). 

Both homologues of HWPl were disrupted (Staab et aL (1999)) using the "URA 
blaster" method adapted from the protocol developed for use with S. cerevisiae (Alani & 

30 Kleckner, 1 16 Genetics 541-5 (1987)) described by Fonzi and Irwin. Fonzi & Irwin, 134 
Genetics 717-28 (1993). Plasmid HWP1URA3 was digested to completion with Hind HI 
which cuts in the polylinker region and in the HWPl coding region to release a 5.3 kb 
fragment harboring the hisG-URA3~hisG disrupted HWPl gene. Five or 10 ^g of digested 
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pHWPlURA3 DNA was used to transform the ura strain CAI4 (Fonzi & Irwin, 134 
Genetics 717-28 (1993)). Postlethwait & Sundstrom, 177 J. Bacteriol. 1772-79 (1995), 
Transformants were screened for the desired recombination events by Southern blot 
analysis. Postlethwait & Sundstrom, 177 J. Bacteriol. 1772-9(1995). Of the twelve Ura"^ 
5 transformants analyzed, three displayed the desired homologous recombination integration 
event at the HWPl locus. One strain, CAH7, was chosen for further manipulations. To 
remove the URA 3 selectable marker, ura auxotrophs of CAH7 were selected on media 
containing 5-fluoroorotic acid (Boeke et ah, 197 MOL. Gen. Genet. 345-6 (1984); Fonzi & 
Irwin, 134 GENETICS 717-28 (1993)) prior to the next round of transformations to reestablish 
10 the Ura^ phenotype. All six Ura^ strains examined had the desired loss of URA3 by 

homologous recombination between the hisG repeats. One strain, CAH7-1, was transformed 
% with the disrupted HWPl cassette, and Ura^ isolates were screened for homologous 

£ integration of the transforming DNA at the intact HWPl locus. Two of nine transformants 
= r: analyzed harbored the desired disrupted HWPl loci, and as expected did not have HWPl on 
Jj^lS hyphal surfaces as detected by indirect immunofluorescence assay (IF A). Staab et al. (1996). 
Cr An Ura" auxotrophic derivative of CAH7-1 A was generated by selection on 5-fluoroorotic 
l^^ medium. 

if An HWPl-\- revertant strain was created by using the ura HWPl strain, CAH7-1A1, as 

a recipient strain for equal amounts (5 jig each) of //mdlll-digested pGBHWPl, and J2?a I 

20 and Xho I-digested p24enura DNA. Postlethwait & Sundstrom, 177 J. Bacteriol. 1772-9 
(1995). Plasmid 24enura harbors an t/7L45-disrupted C albicans enolase gene. Ura+ 
transformants were screened for HWP1+ phenotype by IF A of germ tubes, and one 
transformant, CAHR3, was further analyzed by Southern blotting. CAHR3 had the desired 
homologous recombination insertion at the HWPl::hisG locus, and one of the enolase loci, 

25 was disrupted with URA3, There were no distinguishing phenotypic features between 

CAHR3 and CAH7-1 (Ura+ single disruptant). All of the strains produced germ tubes in 
liquid and in solid media except when dropped on the surface of agar media. Importantly, and 
as expected, the HWP1+ revertant strain (CAHR3) possessed HWPl on hyphal and not yeast 
surfaces as detected by IF A. The immuno fluorescent results show that not only was HWPl 

30 gene replacement successful, but that the developmental hypha-specific regulation was also 
maintained. In other experiments, CAH7-1 A was found to lack //fFP7 mRNA. Staab et al. 
(1999). 
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III. Effect of HWPl on adhesion of germ tubes to human buccal epithelial cells. 

The isogenic HWPl strains created above were used to assess the function of HWPl 
in adhesion to buccal epithelial cells. HWPl allowed germ tubes to form stable attachments 
to BECs that were resistant to treatment with SDS and heat (stabilized adhesion). Stabilized 
5 adhesion was abrogated by known inhibitors of transglutaminases, iodoacetamide and 
monodansylcadaverine. The homozygous HWPl mutant strain, CAH7-1 A, was unable to 
form stable attachments to BECs whereas the revertant strain and the heterozygous mutant, 
CAHR3 and CAH7, respectively, were unaffected in the ability to form stable attachments. 
In biochemical experiments, recombinant HWPl was shown to be a substrate for mammalian 

10 transglutaminase and associate tightly with BECs envelopes after incubations under the 
adhesion assay conditions. Staab et al. (1999). The data points to the abihty of HWPl to 

) mediate stable attachments to human tissue through interactions with host transglutaminases. 



1%- IV. HWPl is important for systemic candidiasis. 

£15 Mice (6/group) were injected intravenously with wild type or mutant strains of C 

f-. albicans, CAH7-1 A, which lacks HWPl, was significantly reduced in the ability to cause 

: lethal candidiasis compared to strains with HWPL 



^1 V. HWPl is important for gastrointestinal candidiasis. 

1:^ 20 Beige nude, Epsilon 26 and beige het mice were raised in the Gnotobiotic Laboratory 

at the University of Wisconsin, Madison and were maintained under germfree conditions. 
Most mice were used at the age of 4 to 7 weeks, however a few pups and older animals were 
also included. Both male and female mice were used. Mice were monoassociated on Day 0 
with C albicans strains CAH7 {HWPl/hwpl\ CAH7-1 A {hwpl/hwpl), CAHR3 (revertant) 

25 and SC5314 (wild type) by swabbing the mouth with a suspension containing 

lO^blastoconidia per/ml. Mice were maintained in sterile isolators for up to seven weeks and 
watched for signs of ill health including lethargy, matted or thinning fur, chills, closed eyes, 
and emaciation that mandated sacrificing. Thus the terms "ill" and "survival" are 
synonymous and are used interchangeably in the description below. Fecal pellets were 

30 collected to verify that animals were colonized and to verify the identity of strains of C. 
albicans that the animals had received. Upon sacrificing, quantitative cultures of cecal 
contents verified the presence of C albicans in individual animals. Quantitative kidney 
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cultures were performed to determine whether C. albicans had translocated across the 
gastrointestinal tract. 

Gnotobiotic beige nude mice that received C. albicans strains with HWPl were more 
prone to failure to thrive than mice that received the C. albicans hwpl null mutant strain 
5 (Figure 2). Only two of seven beige nude mice given the WPi -containing heterozygous C. 
albicans strain and one of four mice given the HWPl revertant remained healthy for six 
weeks. In contrast, only two of 11 mice that received the hwpl null mutant strains of C. 
albicans became ill. The survival difference between groups of mice that received the HWPl 
heterozygote strain CAH7 compared to the hwpl null mutant was statistically significant ( P 
1 0 < 0.05). The combined groups of mice that received the HWPl containing heterozygous and 
revertant, also showed a significant difference in survival firom mice receiving the hwpl null 

0 mutant strain. The absence of a significant difference in the survival of mice that received 
Zi the revertant stram and the hwpl null mutant (P = 0.058) was because one of the mice was 
!y sacrificed for "comparison" prior to becoming ill. When this mouse is removed from the 

01 5 analysis, the survival difference between null and revertant is significant. 

}z: The survival of Epsilon 26 mice also depended on the presence of HWPl in C. 

albicans strains used for monoassociation. All mice that received //fFPi -containing 
M heterozygous C. albicans strains and four of five mice given the HWPl revertant strain 
^! became ill. Only one mouse that received the revertant strain remained healthy for seven 
O20 weeks. In contrast, none of the five mice that received the hwpl null mutant strains of C. 

albicans became ill. The survival differences between groups of mice that received the HWPl 

heterozygote and revertant strains compared to the hwpl null mutant were statistically 

significant ( P < 0.01). 

25 Example 2: Regulation of the HWPl gene 

I. Sequence of the 5' flanking region of HWPl. 

The upstream region of HWPl that is expected to contain the promoter and regulatory 
sequences has been isolated as a 1 .6 kbp Bglil fi-agment of C. albicans genomic DNA. The 
cloned HWPl promoter region includes 1,467 bp upstream of the ATG start codon of the 
30 structural HWPl gene. Genetic organization of the upstream region is shown in Figure 3 
with cloned DNA indicated in the shaded portion. 

Features of the cloned DNA sequence are depicted in SEQ. ID. NO: 1 (Figure 4). A 
10 nucleotide perfect direct repeat was found at positions 554 and 567 and a putative TATA 
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box was found 67 bp upstream from the begimiing of the transcript. Searches for potential 
binding sites in fungal matrices using Matlnspector v2.2 revealed the presence of sequences 
for NIT2 (16 sites), PH04 (2 sites), MATAl (1 site), and HSF (1 site) binding sites within 
the 1 ,467 nucleotide sequence. A *S. cerevisiae cAMP response element (CRE) site was also 
5 found within a PH04 site. The presence of NIT2 and HSF sites are particularly interesting as 
there is evidence for nitrogen regulation and temperature regulation of germ tube formation 
in C. albicans 

The predicted position for the TATA box is shown. A notable feature is a pair of 
NIT2 sites in opposite orientations near position 800 that form an almost perfect palindrome 

10 on opposite DNA strands. 

Matches to UAS's or URS's were not found in the 613 bp figure between the left 
Bglll site and the 5' neighboring gene's stop codon. APL6 (homologue of the human AP-3 
complex unit was mapped to this locus by the C. albicans Sequencing Project at Stanford 
University). These findings indicate that all of the necessary cis elements needed for 

1 5 developmental regulation are found within the 1 ,467 bp upstream of HWPl. 

The numerous sites for NIT2 binding is significant because the NIT2 protein of 
Neurospora crassa is a global positive-acting transcription factor of nitrogen structural genes 
when preferred nitrogen sources are lacking. A NIT2-like protein likely may regulate 
virulence genes in C albicans. 

20 A search of the C albicans genome revealed that GAT99 (GAT-1 or GATA like protein) 

is highly homologous to NIT2. The GAT99 predicted amino acid sequence was also 
homologous to AREA, NRE and GLN-3. Referring to amino acid identity of GAT99 [SEQ. 
ID/ NO: 2] to the 50-residue DNA binding domain in NIT2 [SEQ. ID. NO: 3] was 92% by 
the ALIGN program and is shown in Figure 5. This region is highly conserved in GATA 

25 factors. Finding GAT99 in C. albicans suggests that regulation of HWPl expression may be 
tied to nitrogen regulatory circuits. In fungi, NIT2 sites in promoter regions vary in number, 
location, and orientation; strong NIT2 binding sites are usually found as two or more 
elements located within 30 bp of each other in the same or opposite orientation. There are 
three such pairs of NIT2 sites in the 1467 bp upstream region of HWPl, one set starting at bp 

30 560, the second at bp 800 and the third at bp 1246. Since mutations in any of the nucleotides 
in the core sequence greatly reduce or eliminate NIT2 binding these NIT2 elements will be a 
focus of promoter mapping using deletion analysis and site-directed mutagenesis. 
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II. Sequence of the 3' flanking region oiHWPl. 

1,441 nucleotides downstream of HWPl have also been cloned and sequenced. The end 
of transcription was determined by 3' RACE PGR using an i7fFP7-specific oligonucleotide 
corresponding to residues 691-710. The 3' DNA sequence of the RACE PCR product 
5 showed that translation of HWPl terminated with a UAA codon followed by 277 nucleotides 
of untranslatead message. A polyadenylation site (AAUAAA) 252 nucleotides from the stop 
codon, and a poly A tail of 14 adenosines were also part of the 3' region of the message. The 
polyadenylation site, and other sequences potentially important for 3' end formation are 
shown in bold. 

10 Referrring to Figure 6 [SEQ. ID. NO: 4], a truncated open reading frame (bold-italics) 

consisting of 178 amino acids, beginning with an ATG and interrupted by the Bam HI site at 
the end of the clone (underlined), exists at the 3' end of this region indicating the presence of 
an additional gene. The homologue of RAD2 (excision repair gene) was mapped to this open 
reading frame by the C albicans Sequencing Project at Stanford University. Studies of the 

15 open reading frame showed that this gene is not differentially regulated (mRNA is present in 
both yeast and hyphal forms). 

III. Kinetics of Expression of HWPl mRNA. 

Initial studies showed the presence of an abundant HWPl signal on Northern blots 
20 three hours after initiation of mass conversion from yeasts to hyphae (strain SC53 14) in 

Ml 99. To gain more information about the kinetics of expression of HWPl, Northern blots 
were performed on cells placed in prewarmed Ml 99, Lee's pH 6.5 medium, and Ml 99 with 
5% fetal calf serum beginning at 10 minutes and continuing for 7 hours (Figure 7). Under 
these conditions, close to 100% of the cells form germ tubes; no budding yeasts were 
25 observable. Each lane contains 2.5 ug of total RNA. The presence of nearly equivalent 
amounts of RNA in all lanes was verified by using anl8S ribosomal RNA probe which 
controls for sample amounts in lanes. Oligonucleotide primers representing nucleotides 314 
to 332 and complementary to 967 to 986 of 18S rRNA gene were used to amplify a 672 bp 
fragment by PCR. The PCR product was radiolabeled and used to probe stripped Northern 
30 blots. The blots were first probed with HWPl cDNA which represents the 5' region of the 
structural gene that encodes the proline and glutamine-rich repetitive N-terminal domain 
HWPl. 



-56- 



Referring to Figure 7, the blot has been overexposed to show that HWPl mRNA is 
detectable within 20 minutes and at 7 hrs of placing cells under induction conditions {HWPl 
mRNA is clearly visible in the autoradiograph but is faint in the reproduction above). The 
message level increases dramatically between 20 and 30 minutes and continues to rise, 
5 peaking at 3-4 hr. After four hours, the HWPl mRNA begins to decline until 7 hr, where it 
stabilizes at a low level. 

To correlate the presence of HWPl mRNA with protein (HWPl) and germ tube 
formation, indirect immunofluorescence experiments were performed using monospecific 
antiserum to recombinant HWPl (rHWPl). Germ tube production and surface HWPl 

10 production are synchronized, both appearing at 45 min on approximately 25% of yeasts and 
are seen on the majority of cells by 60 min (not shown). Messenger RNA precedes the 
presence of visible germ tubes and surface HWPl by 20-25 minutes. 

The kinetics of HWPl expression appears to differ quantitatively from mRNA for 
house keeping genes, but the overall rise and fall of message levels is similar. HWPl 

1 5 message is evident nearly an hour earlier than ENO mRNA, a gene which encodes an 

abundant glycolytic enzyme, enolase, that is present in yeast and hyphal growth. However, 
both mRNA's peak at 3-4 hours and drop thereafter. Similar results were found for other 
non-developmentally regulated gene, ACTl and carboxypeptidase Y. These results show that 
expression of HWPl is produced earlier and to higher levels than housekeeping genes, 

20 reflecting the presence of activation mechanisms not found for housekeeping genes or at least 
in addition to those for housekeeping genes. 

The results show that repressive mechanisms serve to shut off HWPl during yeast 
growth, through interactions of HWPl upstream sequences and DNA-binding proteins that 
interact with TUPl or other general repressor proteins. The results also show that activating 

25 factors may serve to increase HWPl mRNA levels above those for other abundant proteins 
such as enolase and actin during germ tube growth. 

Example 3: Creation of an HWPl :GFP reporter fusion for monitoring the HWPl 
promoter 

30 HWP1:G¥F reporter fiision was created for monitoring the HWPl promoter and for 

verifying that the appearance of GFP accurately reflects the activity of the native HWPl 
promoter. 
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The strategy was to target a transcriptional fusion between HWPl flanking regions 
and a reporter gene (GFP) to the genomic enolase locus. The reason for inserting the reporter 
into the genome is to avoid artifacts associated with overexpression and problems with 
plasmid instability that are inherent in the use of C albicans ARS (CARS) plasmids. 
5 Unfortunately, plasmids with features such as S, cerevisiae CEN plasmids which have 
centromeric sequences that confer stability and low copy number are not available for C 
albicans. 

The choice to use the ENO genomic locus was made for several reasons, the first 
being convenience. A construct (p24enura) having the C albicans enolase gene disrupted 

1 0 with URA3 (58) was used. A second reason is that the effect of gene disruption at the ENO 
locus of C. albicans is known. Prehminary work has shown that disruption of one of the C. 
albicans enolase genes does not adversely affect growth, morphogenesis (Postlethwait & 
Sundstrom, 177 J. Bacteriol, 1772-9 (1995); Staab et al. (1999)) or pathogenesis in a 
murine model of systemic candidiasis. The virulent HWPl revertant in the systemic and 

15 mucosal candidiasis studies was prepared using co-transformation with p24enura. Staab et al. 
(1999). Studies with UnoPP-1 ((Postlethwait & Sundstrom, 177 J. Bacteriol. 1772-9 
(1995)) (and the HWPl revertant (Staab et al. (1999)) showed that enolase gene disruption 
also does not affect the frequency of germ tube formation, the length of germ tubes that are 
formed or expression of HWPl on germ tube surfaces. 

20 In principle, use of the ENO locus is similar to use of the ADE2 locus for studies of 

promoters of genes involved in switching (Srikantha et al.) and, thus, a precedent has been set 
for studying promoter activity at ectopic loci. ADE2 and ENO are both constitutively 
expressed, housekeeping genes. There is no known reason why the ENO locus would be 
more cumbersome to work with than ADE2, The availability of the ADE2 locus does serve 

25 as a fallback gene should problems with the ENO locus arise. The ADE2 gene is cloned and 
could new constructs utilizing ADE2 must be made. 

The HWPl locus is not particularly useful for this work because it is necessary that 
promoter sequences that are manipulated and introduced into the genome are studied. If the 
construct were introduced at the HWPl locus, native promoter sequences would probably 

30 confound the results. The experiments depend on being able to delete and add back portions 
of the region upstream of the promoter. This requires use of an independently rephcating, 
stable plasmid which is not available for C albicans, or ectopic expression. 
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I. Plasmid construction. 

To be able to easily assess the role ofHWPl flanking sequences in gene expression, 
the coding region of HWPl was replaced with that of the green fluorescent protein (GFP) 
which has been optimized for expression in C. albicans by Cormack. Cormack et al. The 
transformation recipient is the ura auxotroph CAM, The presence of a functional HWPI 
gene in this strain are used to correlate the expression of GFP message with HWPl message, 
confirming the utiUty of the reporter strain as accurately reporting HWPI expression. 

The features of the plasmid that was constructed according to the scheme in Figure 8 

are: 

1) 1,467 bp of HWPl upstream flanking sequences; 

2) 352 bp of downstream flanking sequences; 

3) the C. albicans enolase gene disrupted with the C albicans 
URA3 gene for selection in the ura3 strain CAM, and for 
targeting to of one of the ENO gene loci of C albicans; and 

4) a plasmid lacking HWPl upstream sequences is used as a 
negative control. 

II. Transformation of C albicans and comparison of GFP expression to expression of 
the native chromosomal HWPl gene. 

Plasmid HWPIGFVI was digested with Cla I (arrow; see Figure 9) which cuts at a 
single site in the C albicans enolase gene to target the construct to the ENO locus in the 
genome using spheroplast transformation. This resulted in a mutated copy of the enolase gene 
disrupted with URA3 adjacent to a wild type copy as diagrammed below. Transformants 
were selected on media lacking uridine. A control plasmid lacking HWPl upstream regions 
but otherwise identical to pHWPlGFPl (pGFPl) served as a negative control. 

Transformants were first screened by Southern blotting to verify integration at the 
enolase locus. The Southern blotting experiments were also used to determine the copy 
number of pHWPlGFPl DNA inserted into the genome, since it is possible that more than 
one copy of DNA will become inserted. The copy number of inserted plasmids were 
determined by comparing hybridization intensities of DNA from transformants with a 
standard curve of GFP DNA to identify transformants with a single copy of GFP DNA. This 



procedure was applied in all the following experiments to ensure that strains with single 
integrations will be compared for GFP levels. 

GFP expression under the control of the HWPl promoter had been achieved. GFP 
was detectable in cells making germ tubes and was not detectable in yeast form cells. The 
5 result showed that the HWPl flanking sequences present in the plasmid not only permit 
expression of GFP, but that they also confer hypha-specific regulation to GFP, confirming 
that the sequences in the plasmid do have controlling elements. Photomicrographs showing 
GFP expression in germ tube forming cells that were taken with an epifluorescence 
microscope and light photomicrographs of the same field showing that only cells with germ 

10 tubes fluoresced. 

An important point that was elucidated by these unstable constructs is that position 
effects are not likely to play a major role in expression of HWPL Since integration into the 
chromosome was not achieved, developmental control was mediated solely by plasmid 
sequences. Additional evidence for the lack of a role for position effects is the absence of 

1 5 developmental regulation of the RAD2 gene which is just downstream of the HWPl 

Stable integration of the //^Pipromoter: GFP construct at the genomic enolase locus 
have also been achieved. Linearization of the construct at the Cla I site in the enolase coding 
region followed by transformation of CAM with 10 |ag of DNA produced four transformants. 
Two of these (GFP1/C2 and GFP1/C3) were stable, retaining the plasmid in the absence of 

20 selection. Southern blotting of genomic DNA digested with Aval confirmed that both 
transformants had integrated GFP into the ENO locus of C albicans (Figure 10). The 
integrated construct produced a 7.8 kb Aval band that was recognized by both enolase and 
GFP probes. The undisrupted enolase locus produces a 6.4 kb band (Figure 10, very faint 
band in lane 4, see arrow). As expected, GFP sequences are not found in the parental strain, 

25 CAM (Figure 1 0, lane 6). 

Several growth conditions were employed to show that the HWPl promoter: GFP 
construct is developmentally regulated. GFP1/C2 and GFP1/C3 transformants were grown to 
stationary phase in YNB (yeast nitrogen base with 50 mM glucose) for 48 hrs at room 
temperature before placement in prewarmed (37**) Ml 99, fresh YNB at 27"*, or in the four 

30 Lee's media conditions (pH 4.5 and 6.8 at 27"* and 37'') with gentle shaking at the appropriate 
temperature for 3 hrs. The cells were examined for the production of GFP in germ tubes 
incubated in Ml 99 and Lee's pH 6.8 at 37*". As expected, fluorescence was not localized to 
germ tubes but was found throughout the cytoplasm of C albicans cells because GFP does 
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not have a signal sequence or other features that mediate secretion and cell wall expression 
that are present in HWPL In addition, the appearance of fluorescence coincided with the 
induction of germ tube formation (Figure 11, Panel A). Yeast (Figure 11, Panel B) and 
pseudohyphae (Figure 11, Panel C) forms did not express GFP. 
5 Given the lack of GFP expression in yeast forms, it is unlikely that the GFP construct 

will fluoresce without the flanking sequences. However, as an absolute control for 
background fluorescence, a plasmid lacking upstream sequences is prepared. Fluorescence in 
expressing strains are compared to strains transformed with the GFP-ENO-URA3 portion of 
the plasmid localized to the ENO locus. DIG differential interference microscopy and the 

10 ability to merge DIG images with fluorescent images are essential for these experiments. 
To show that GFP does not interfere with HWPl expression, indirect 
immunofluorescence assays were performed to detect surface HWPl using GFP-producing 
transformants (Figure 12). Secondary anti-rabbit antibodies conjugated with Texas Red 
rather than FITC were used, so that HWPl could be detected in the presence of GFP. No 

15 differences were noted in the appearance of HWPl on GFP-producing cells compared to wild 
type cells and parent strains. 

The lack of inhibition of HWPl expression by the GFP construct is fundamentally 
important in that it shows that artifactual influences on mechanisms regulating HWPl 
expression will not occur. 

20 Fluorescence microscopy will allow rapid, qualitative assessment of GFP production 

but will not permit quantitative measurements. Flow cytometry is a straightforward and 
simple method to determine fluorescence levels . The number of cells and their relative 
fluorescence are determined by flow cytometry after analysis of 10,000 events. The amount 
of fluorescence for a defined nxunber of individual cells (the actual number of cells necessary 

25 will be determined in preliminary experiments) is quantified using an image analysis system. 
An example of flow cytometry data used to analyze the stable transformants is shown in 
Figure 13, The graph in Figure 13 (Panel B) shows a three log shift in fluorescence of germ 
tubes; cells (yeasts) that did not germinate (approximately 20% of the cells) are non- 
fluorescent (Figure 13, Panel A). 

30 Fluorometry using glass bead cell extracts and a fluorometer is also be used to 

quantitate GFP fluorescence. The emission at 505 nm is read with excitation at 488 nm. The 
advantage of fluorometry is the ability to measure fluorescence for a given amount of protein 
extract which permits detection of subtle changes in fluorescence that may not be apparent by 
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flow cytometry. Both methods are routinely used to monitor GFP expression and the results 
from the two methods are expected to be comparable. 

To further ensure that GFP expression reflects HWPl expression, the kinetics of 
production of GFP mRNA was correlated with the kinetics of expression of HWPl mRNA. 

5 HWPl mRNA appears within 20 minutes of placing yeasts in germ tube induction conditions. 
Northern blotting with the reporter strain is also performed, and GFP mRNA is detected by 
hybridization and correlated with the presence of HWPl mRNA. Ml 99 is used at 3TC 
without serum in these experiments the kinetics of induction are independent of the 
composition of the media. GFP fluorescence is expected to coincide with the appearance of 

1 0 HWPl mRNA and precede the emergence of germ tubes. 

The kinetics of the disappearance of GFP mRNA is correlated with the disappearance 
ofHWPl mRNA when cells are transferred from induction to repressing conditions. Two 
methods for repression are employed. In the first method, cultures of 100% germ tubes that 
have been growing as germ tubes for 2.5 hours are rapidly subjected to a 10-fold reduction in 

15 culture volume by filtration (/.e., a 1 liter culture will be reduced to 100 ml by filtration). A 
second method involves dropping the temperature by filtering the 2.5-hour germ tube culture 
and resuspension in Ml 99 at 30°C. hi both cases, transferred organisms in repressing 
conditions are removed at 5 min intervals and mRNA is isolated followed by Northem 
blotting using probes for HWPl and GFP mRNA. The morphology of the cells is monitored 

20 by microscopy and the appearance of cells is documented by photomicroscopy using a CCD 
camera. Thus the kinetics of GFP expression is correlated with HWPl mRNA expression 
during induction and repression of germ tube formation. 

Example 4: Identification of putative UAS and URS elements that regulate HWPl 
25 expression. 

L Deletion analyses. 

The locations of cz5-acting regulatory sequences upstream of HWPl is determined by 
performing the reporter experiments using plasmids which contain deletions in HWPl 
upstream sequences. The location of UAS's is detected by the correlating the loss of GFP 
30 expression with the deleted region. Constructs lacking 5' sequences are also be assessed. 

The initial analysis of the HWPl promoter region is to identify regions that affect 
expression by deletion analysis. DNA fragments having less and less 5^ DNA but the same 3' 
sequence for fusion to the open reading frame of GFP were created (Figure 14). Sequential 
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deletions of DNA are be performed by PGR with oligonucleotides to sequences progressively 
closer to the -1 site [SEQ. ED. NOS: 5-11]. The same 3^ ohgo (Figure 14) used to generate 
the original plasmid construct (pHWPlGFPl) are used. The same plasmid (pBSBglIIl.8) 
with 5' HWPI genomic DNA is used as template for PGR. DNA sequences specifying a 
5 Hind III site on the 3' oligonucleotide and Xho I sites on the 5 ' oligonucleotides resulted in 
the addition of Hind III and Xho I sites on the 3' and 5' ends of the PGR products, 
respectively. 

The strategy described above for constructing deletions required the removal of the 
Hind III site in enolase by site-directed mutagenesis using the Promega GeneEditor in vitro 

10 Site-Directed Mutagenesis System (Madison, WI). This high efficiency system allowed for 
the generation and selection of oligonucleotide-directed mutations. The method takes 
advantage of an E, coli strain mutated at the MutS gene (DNA repair) which permits recovery 
of plasmid DNA without selection against the desired mutation. The system also allows for 
generating simuhaneous mutations using one template, a feature that will become usefiil to 

15 study the effects of various mutations in a single construct on GFP transcription. 

A mutagenic ohgonucleotide that mutates the Hindlll site in enolase is generated. 
The loss of one Hindlll site is used to screen potential mutants, and the desired base change 
in enolase is verified by DNA sequencing. The resulting plasmid is named pHWPlGFP2, 
and is used as pHWPlGFPl in transformations of C albicans. The loss of the Hindlll site in 

20 enolase should not affect the developmental regulation of GFP observed with the original 
pHWPlGFPl construct since enolase sequences are being used to target integration in the 
chromosome and are not driving GFP transcription. Before the PGR products are cloned into 
pHWPlGFP2, the original 1.46 kb 5' HWPI fragment is separated from the rest of the 
construct by gel electrophoresis. The gel-purified construct minus the 1.46 kb fragment is 

25 then be used as the vector for cloning the various PGR-generated deletions. 

The linearized plasmid with Xhol and Hindlll ends is also used to generate a 
promoterless construct. The 5* overhang ends produced by Xhol and HindlH is blunted by 
using Klenow DNA polymerase fragment in the presence of dNTPs. Self-Mgation of the 
blunt-ended plasmid produces a construct without a promoter that is named pGFPl. The 

30 oligonucleotides for generating the first set of deletions are shown in Figure 14 [SEQ. ID. 
NOS: 5-11]. 
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Growth Conditions: To identify UAS's, and URS's three independent, single copy, 
transformants for each construct are analyzed in conditions that induce the appropriate form 
of growth. The conditions are: 
For germ tubes; 
5 1. M199at37°C* 

2. M199 with 5% serum at 37^C* 

3. Lee's medium at 37C, pH 6.8* 
^Clarified, whole human saliva at ST'^C 



10 For yeasts; 

1. YNBat37°C* 

2. Lee's media, 37^C,pH4.5* 

3. YNBat25°C* 

*Both liquid cultures and agar plate media will be analyzed. 

1 5 UAS's important for HWPl expression are identified by a decrease in expression of 

GFP during germ tube inducing conditions compared to the strains with the intact HWPl 
promoter region. URS's important for repression of HWPl are identified by de novo 
expression of GFP during yeast growth. Liquid cultures are analyzed by flow cytometry and 
fluorometry. For fluorometry, GFP fluorescence in extracts is determined relative to the 

20 amount of protein in extracts. The average values for three transformants is used. A given 
deletion is considered to harbor a UAS element if its deletion leads to at least 30% reduction 
in GFP levels compared to fluorescence of transformants with the intact construct, and to 
harbor a URS element if its deletion leads to at least a 3-fold enhanced expression during 
growth as yeasts. Fluorescence of solid cultures is assessed qualitatively using the 

25 fluorescence microscope. 

II. Fine structure mapping and site-directed mutagenesis of DNA segments that are 
predicted to be functionally important. 

Deletion analysis provides strong preliminary information about the location of 
30 regulatory elements within approximately 200 bp. This section performs fine structure 
mapping of the regions deemed important, as set forth in section L For example, if the 
experiments note a loss in regulatory elements in the 200 bp segment upstream of the TATA 
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element, then the specific location of the regulatory element is determined by the combined 
results of several approaches: 

1) . The smallest construct containing the elements is analyzed by deletion analyses 
similar to those described above, but by constructing a new deletion series within the region 

5 of question to generate a series of overlapping deletions differing by 50 bp in size. 

2) . Site-directed mutagenesis using the Promega GeneEditor in vitro Site-Directed 
Mutagenesis System is used for generation and selection of oUgonucleotide-directed 
mutations within the 200 base pair region. The intact HWPl promoter region:GFP construct 
is used as the template for mutagenesis and the exact sequences to be mutated will depend on 

10 the actual sequence and the results of the fine structure deletion mapping above. 

III. Impact of regulatory elements on minimal promoter 

The most deleted HWP:GFP construct retains the TATA box and is predicted to bind 
basal transcription factors. Addition of regulatory elements to this putative "minimal 
15 promoter" may lead to an increase or decrease in basal promoter activity. The effect of DNA 
with UAS or URS elements on this highly deleted construct is more easily predicted once the 
minimal GFP fluorescence from this construct is determined. 

Example 5: Characterization of DNA binding proteins (DNABP) that regulate the 
20 HWPl promoter. 

I. Electrophoretic mobility shift experiments. 

cz5-acting regulatory sequences that are the most potent regulators of GFP expression 
are used to identify and characterize binding proteins using electrophoretic gel mobility assay 

25 (EMSA) experiments. Protein extracts containing the putative DNA binding factors are 

prepared from germ tubes and/or yeasts grown under conditions where regulatory activity is 
present based on the experiments using the media described above. Protein extracts are 
pelleted as ammonium sulfate precipitates. Covitz & Mitchell, 7(8) GENES DEV. 1598-608 
(1993). Cells are washed, lysed with glass beads in buffer with protease inhibitors, clarified 

30 by low speed centrifugation followed by ultracentrifugation to remove insoluble material. 

DNA is removed from the protein extracts by precipitation with polyethyleneimine followed 
by centrifugation. Proteins in the supernatant are precipitated with soUd ammonium sulfate. 
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followed by collection of the pellet by centrifugation. Extracts from hyphal and yeast growth 
forms are prepared using multiple media conditions described below. 

The EMS A method of Chodosh is used. Chodosh, CURRENT PROTOCOLS IN 
MOLECULAR Biology (Ausubel et al., eds.) (1988). Fragments exhibiting the strongest UAS 
5 and URS activities, as determined above, are used as probes. Double-stranded 

32 

oligonucleotides are prepared from these regions and are end labeled with [y- ]ATP. 
Extracts and probes are incubated in binding buffer containing glycerol, and poly(dI:dC) 
prior to separation by electrophoresis with a high-ionic strength 4% polyacrylamide gel. Gels 
are dried and examined by autoradiography. 

10 The sizes of retarded complexes are compared to the mobility of the labeled DNA 

fragment in the absence of added proteins. The number of complexes, and their correlation 
with yeast or hyphal morphology is assessed. To verify the specificity of the complexes, cold 
probe present in 50 fold excess is added as a competitor in the incubations. If the shifted 
bands are specific for the labeled probe, the bands should disappear in the presence of non- 

1 5 radioactive competitor probes. 

The gel mobility shift experiments provide insight into the regulation of HWPl 
expression. There may be proteins in DNA-protein complexes that are specifically present 
when hyphal-extracts but not yeast extracts are used. If repression during yeast growth is 
important, yeast-specific complexes will be found. Alternatively, there may be complexes 

20 that are present when extracts from either yeasts or hyphae are used, or there may be both 

morphology-specific as well as non-specific complexes. Growth phase-specific complexes if 
present will be the focus of fixture studies. However, the absence of growth phase-specific 
complexes is also of interest since hypha-specific expression might be a result of differential 
activation of upstream binding proteins through protein modification such as 

25 phosphorylation, or there might be additional protein-protein interactions that determine 
specificity. 

II. Determining sizes of DNA binding proteins. 

Determining the size of the protein(s) that form complexes with HWPl upstream 
30 DNA is necessary for subsequent efforts to clone the corresponding gene(s) from expression 
libraries. To determine the sizes of proteins complexed to HWPl upstream DNA, complexes 
are crosslinked using UV light, followed by elution from the non-denaturing gel, 
solubilization in Laemmli-SDS sample buffer and separation on SDS-PAGE gels, followed 
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by autoradiography as previously described by Chodosh. The sizes of the proteins are 
determined using a standard curve of proteins of known molecular weights. 

III. Cloning genes encoding DNA binding proteins. 

5 After the detection and preliminary characterization of proteins that bind to the 

upstream region of HWP], the genes for the binding protein of interest are cloned. Having 
the primary sequence for proteins that bind to the upstream promoter region of HWPI 
provides information for future investigations into the mechanisms of morphology-specific 
gene-expression of HWPL 

10 DNA affinity chromatography is the method of choice for isolation of DNABPs. The 

UAS and URS elements identified in previous sections are used to isolate the desired 
regulatory factors in native form firom nuclear extracts of C albicans cells. Primers that bind 
to DNA in the HWPI promoter region template that surround putative UAS and URS 
elements of interest are amplified by PCR to generate products 80-100 nucleotides in size. 

1 5 Primers for PCR reactions are synthesized with a 5' biotin group. PCR products are 
separated on and excised from agarose gels. The biotinylated PCR products are then 
immobilized on Streptavidin-conjugated Dynabeads as described by Gabrielsen and Huet 
(Gabrielsen & Huet, 218 METHODS Enzymol. 508-25 (1993)) followed by washing to 
remove imbound DNA. 

20 To purify DNABPs, nuclear extracts from cells grown in appropriate media are 

incubated with the DNA-beads in binding buffer containing poly (dl-dC) and washed, 
followed by elution of the DNA-binding protein with high salt, and then desalting. Myokai et 
al., 96(8) Proc. Natl. Acad. Scl USA 4518-23 (1999). To obtain a nucleic acid probe for 
screening cDNA and genomic libraries, purified DNA-binding proteins are subjected to N- 

25 terminal amino acid sequencing and/or sequencing of peptide fi-agments that were purified by 
HPLC. 

Degenerate DNA sequences derived from the amino acid sequences are used to search 
the C albicans genome which is now complete to 5X coverage. Once the DNABPG is 
identified by the initial search, PCR primers representative of the exact sequence in the 
30 genome are designed and used to ampHfy the gene using genomic DNA as a template. A 
possible problem is that the degenerate sequence may or may not successfully locate the 
corresponding genomic DNA in a computer search of the genome depending on the degree of 
degeneracy. If available, the sequenced peptides of the isolated DNABPs are readily 
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identified and this will point to the exact DNA sequences needed for primer design. If the 
proteome is not available, and if the degenerate DNA sequences do not successfully locate 
the DNABPG, other methods will be used to isolate the corresponding gene. Degenerate 
oligonucleotide primers are designed to use in PGR to amplify a region of the DNABPG 
5 using genomic DNA as template to generate a probe for library screening. The DNA 

sequence of the PGR product is analyzed to verify the authenticity of the product as encoding 
the protein fragment. Library screening follows standard techniques that have been used to 
isolate genomic clones of other genes by those skilled in the art. Subclones in pBluescript® 
of large lambda clones are obtained followed by DNA sequencing to verify the identity of the 
10 cloned DNA as being representative of the PGR product and peptide sequences derived from 
the DNABP. 

IV. cDNA library screening using UAS or URS elements as probes. 

An alternative approach is to screen a C. albicans cDNA expression library with the 

15 HWPl upstream sequences identified in the reporter gene experiments and gel mobility shift 
experiments described above. Gurrently-available cDNA libraries consist of cDNA prepared 
from mRNA isolated from germ tubes grovra in Ml 99. A yeast cDNA library is prepared, if 
necessary as determined by experiments or delineated above, to clone genes for proteins that 
bind to URS's using mRNA from yeasts in logarithmic growth. Because lambda plaques 

20 generally contain small amounts of fusion proteins in native conformation, the screening 
sensitivity is enhanced by concatenating the probe so as to generate DNA fragments with 
multiple copies of the binding sites. Plaques that hybridize with the probe (critical cf^-acting 
sequences from previous experiments), but not with a control DNA probe are purified, and 
inserts are sequenced and analyzed for open reading frames. 

25 To verify that genomic clones encode DNABPs of the expected activities, 

recombinant DNABPs is produced in E, coli. Very few C albicans genes have introns, but if 
introns are present (as detected by the presence of conserved splice sites and stop codons 
within the coding region) cDNA clones are obtained by screening the appropriate cDNA 
library prior to production of recombinant proteins. The cDNA clones are used for 

30 production of recombinant proteins in E. coli as is routinely done by those skilled in the art 
using either pGEX or his-tag expression systems, and purified by affinity chromatography. If 
production of recombinant proteins in E, coli is not possible due to codon bias, the yeast 
Pichia pastoris (Livitrogen) will be used. Various recombinant HWPl proteins expressed in 
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p. pastoris, and vectors that fuse "his'' tags to recombinant proteins are now available. The 
recombinant proteins are assayed by gel mobility shift experiments using the previously 
identified c/^-acting sequences as probes, as well as SDS-PAGE analysis to compare the size 
of the purified protein to that initially found in crude extracts, procedures that provides 
5 confidence that the correct proteins that bind C albicans UAS and/or URS sequences are 
being studied. Purity is assessed by the demonstration of a single spot on two dimensional 
electrophoresis. 

V. Characterization of DNA binding proteins by computer analysis of the predicted 

1 0 open reading frame. 

cDNA clones isolated from library screening are used to screen a genomic library of 
C. albicans. Genomic clones containing proteins that bind regulatory sequences ofHWPl 
are completely sequenced on both strands. The BLAST program is used to search the 
database through the National Center for Biotechnology Information to identify homologues 

1 5 in other organisms. The predicted protein sequences are extensively analyzed for the 

presence of hydrophobic segments, clustering of acidic and basic residues, tyrosines and 
phenylalanines, protein kinase target sites, nuclear localization, zinc finger, helix-tum-helix, 
homeodomain motifs and leucine zippers that are known to exist in other DNABPs, as well as 
for the presence of other amino acid motifs that specify functional attributes of proteins as 

20 well as for secondary structure. 

IV. Localization of DNA binding proteins. 

DNABPs exert their effects after localizing to the nucleus. The presence of protein in 
the nucleus should correlate with the growth stage-specific binding determined in the gel 

25 mobility shift assays. To monitor expression of the DNABP in experiments described below, 
DNA encoding a c-myc tag that has been engineered to reflect the codon usage of C. albicans 
is fused to the recombinant protein. The tag is fused to the C terminus of the protein unless 
suspicions arise that predict important functions for the C terminus, or that predict that the C 
terminus is unexposed in the interior of the protein. In these cases, secondary structure 

30 predictions and predictions about conserved functional motifs are considered in positioning 
the c-myc tag in a location that is exposed on the surface of the protein for interaction with 
commercial anti-c-myc monoclonal antibodies. 
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The c-myc-tagged DNABPG is introduced into CAM by spheroplast co- 
transformation at its native locus. Co-transformation protocols targeting enolase disrupted 
with URA3 to the enolase locus may serve as selection for DNA uptake. Integration of the c- 
myc-tagged DNABPG at its chromosomal locus is performed by Southern blotting. 
5 A potential problem with the previous approach is that creation of the c-myc-tagged DNA 
might cause artifactual localization. Therefore analysis of the native protein may be required. 
Monoclonal antibodies to purified recombinant DNABPs are also be prepared, to analyze 
native, imtagged proteins. 

Additionally, the monoclonal antibodies are also used in super-shifting of bands in 
10 EMS A assays to verify that the cloned gene represents the protein in extracts that cause only 
mobility shifts. 

For localization experiments, yeasts and/or hyphae are formalin-fixed, followed by 
digestion of cell walls with Zymolyase and application to poly L-lysine coated microscope 
slides according to standard techniques. Pringle et al., 194 Guide to Yeast Genetics and 

15 Molecular Biology 565-601 (Guthrie and Fink, ed.) (1991). Cell membranes are 

permeabilized with methanol and acetone and cells are then be incubated with antiserum 
prepared to purified recombinant proteins, or commercial monoclonal antibodies to c-myc 
followed by affinity purified FITC-conjugated goat anti-mouse antibodies. As a positive 
control for nuclear localization, control cells are stained with DAPI, which binds to DNA. 

20 Negative controls consist of antibodies to enolase, a cytoplasmic protein which should not 
give nuclear staining and rhodamine-conjugated phalloidin which stains actin and also does 
not stain nuclei. These experiments verify that the DNABPs are localized in the nucleus and 
are present in the appropriate growth form. 



25 V. Role of DNA-binding proteins in expression of HWPl. 

To determine the function of the DNABPs in expression of HWPl as well as in the 
bud/hyphae transition, gene disruptions will be performed. The disruptions are performed 
using the standard URA-blaster protocol for C albicans (Fonzi & Irwin, 134 GENETICS 717- 
28 (1993)), which was used to disrupt HWPl, To ensure that the resulting phenotypes are 
30 truly associated with loss of the DNABP, revertant strains that have the DNA-binding protein 
gene replaced are also prepared. Co-transformation was used to obtain HWPl revertants in 
double knockout strains, and this approach is used to obtain revertants for these studies as 
well. If co-transformation is unsuccessful, constructs with the DNA-binding protein gene 
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adjacent to the selectable URA3 marker are prepared, which are used to transform ura- double 
knockout strains lacking the DNABPG. 

The role of the DNA-binding proteins in HWP] expression is determined using 
conditions during which HWPl is normally present. HWPl expression in sohd media is also 

5 assessed, since morphogenesis in non-liquid environments may be relevant to disease and 
because germ tube formation is sometimes enhanced when organisms are within or on top of 
agar or other non-Uquid media. Anderson et ak, 52(2) Infect. Immun. 458-67 (1989). 
HWPl can be detected on organisms excised from agar surfaces. For germ tube formation in 
solid media, yeasts are added to agar media (Ml 99 or Lee's media at neutral pH) at 45°C just 

10 prior to pouring as set forth in example 2. Plates are incubated overnight at 30°C, allowing 
pinpoint yeast colonies to form within the agar. Plates are then be transferred to 37°C for 
induction of germ tube formation. The extent of germ tube formation is visualized and 
photographed using a stereomicroscope equipped with an automatic camera. 

15 VI^ Role of DNA-binding proteins in adhesion of C albicans to human buccal epithelial 
cells. 

Null mutants for genes that activate HWPl expression are compared to isogenic 
strains with the gene in adhesion assays to determine if the loss of the DNABPG affects 
adhesion. If HWPl is not expressed, no stabiUzed adhesion is expected, since HWPl is the 
20 only gene encoding a transglutaminase substrate in C. albicans. It is possible that 

conventional adhesins may be co-regulated with HWPl and that the expression of the 
conventional adhesins maybe absent in the null mutant. Both stabilized and conventional 
adhesion assays are performed as previously described. Staab et al. (1999). 

25 Example 6: Identify global regulatory networks by isolation of genes that are co- 
regulated with HWPl using mini-array technology. 

Global regulatory circuits that lead to the production of hypha-specific genes are also 
identified. Many examples in fungi exist where global regulatory circuits affect multiple 
genes that are necessary for growth in specific environments. It is likely that C. albicans 
30 employs such a regulatory circuit for expressing multiple genes during hypha-specific growth 
conditions. The DNABPGs identified provide a valuable tool for probing the C albicans 
genome to identify global regulators. 
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I. Role of DNA binding proteins in controlling expression of known hypha-specific 
genes. 

Several genes other than HWPl have been reported to be hypha-specific. These are 
HYRl, ECEl, ALS3, CHS2, and SAP6. Bailey et al.; Birse et al.; Gow et al., 91(13) Proc. 
5 Natl. Acad. Sci, USA 6216-20 (1994); Hoyer et al.; White & Agabian, 177(18) J. 
Bacteriol. 5215-21 (1995). To determine if the DNABPG acts exclusively on HWPl 
upstream DNA, or if other hypha specific genes are subject to the same regulatory effects, 
Northern blotting is performed using RNA fi-om isogenic strains that have or do not have the 
DNABPG(s), Equal amounts of RNA fi-om the various isogenic strains that were grown in 
10 germ tube-inducing conditions is loaded on gels, and probes for ribosomal RNA are used as 
internal controls for equivalent loading. Probes, radiolabeled with [a-^^P] deoxycytidine by 
1 the random primer method, include the coding regions of the HWPl gene, the HYRl gene, 
J and other hypha specific genes. If the DNABPG is important for expression of a given 
; hypha-specific gene, either as an activator or as a repressor, the level of mRNA for the 
il5 hypha-specific gene is decreased or increased respectively in the null DNABPG mutant 
J compared to isogenic strains (including a complemented strain) harboring the DNABPG. 

The relative intensities of signals on the membranes is determined by analysis on a 
^ Phosphorlmager. 

The upstream regions of known hypha-specific genes is also be analyzed for the 
f20 presence of the UAS and URS elements that were found to be important. The presence of 
these elements and the Northern blot results using RNA firom strains having the DNABPGs 
are compared to determine the relationship between gene expression and presence of the 
regulatory elements. The data generated from these studies help determine the relationship 
between hypha-specific gene expression and the presence of putative DNABPGs. 

25 

II. Regulation study of the DNABPGs. 

It is possible that the DNABPG's isolated are produced in response to an factor that 
mediates the transcription of the identified genes, and that other layers of regulation are 
involved in expression of hypha-specific genes. If a developmentally DNABPG is identified 
30 then studies similar to the ones performed on the HWPl upstream region is performed on the 
upstream region of the DNABPBs. A reporter construct under the control of the new 
promoter region is constructed. UAS and URS elements in this DNA sequence are identified 
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with the goal of finding upstream effectors in the networks that regulate hypha-specific gene 
expression. 

III. Mini-arr ay-based, transcriptional analysis of a C albicans genomic library to 
5 identify and characterize genes under the control of DNA binding proteins. 

One may identify unknown target genes that are under the control of the DNABPGs 
isolated above. In the near future, this type of experiment will be performed using purchased 
chips or membranes spotted in known locations with oligonucleotides or PCR products 
representative of each open reading frame in the genome. Unfortunately, C albicans chips 

10 and membranes are not yet available; however, a 'Svork-around" method using primers that 
bind to standard E, coli vectors is proposed {see Figure 15). 

The proposed method is comparable to differential screening of a genomic library 
with cDNA (prepared from mRNA) probes obtained from cells growing under differing 
conditions. This new proposed method is predicted to be more sensitive than differential 

15 screening, and the extensive plaque purification and subcloning steps inherent in standard 
library screening protocols is not employed. The method is being used to study H, 
capsulatum genes and was originally proposed by J. DeRisi who has extensive experience 
with genome wide expression monitoring. DeRisi & Iyer 11 (1) CURR. Opin. Oncol. 76-9 
(1999). The method is diagrammed in Figure 15. 

20 The basic approach is to use RNA from the isogenic strains with or without the 

DNABPGs that were grown in germ tube-inducing conditions to prepare labeled cDNA. 
PolyA+ RNA are prepared from total RNA using an oligo dT column and are used to prepare 
^^P-labeled first strand cDNA to probe membranes spotted with C. albicans genomic 
fragments prepared as described below. 

25 The mini-array approach using PCR products spotted onto nylon membranes follows 

procedures described for S, cerevisiae. Cox et al. The use of nylon membranes rather than 
glass chips is chosen because of the greater ease of sample application and because of the 
ability to analyze the results using ImageQuant software and a Phosphorlmager. 

The membranes are prepared by robotically spotting PCR products amplified from 

30 genomic inserts, 0.5-2.0 kb in size, cloned in pBluescript using imiversal primers (see Figure 
14, arrows). The PCR reactions are performed using intact organisms from colonies that 
have been spotted from an agar plate containing the E. coli transformants harboring plasmids 
with genomic inserts to microtiter dishes. The presence of a PCR product in each reaction in 
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the microtiter dish is detennined by agarose gel electrophoresis, A BioMek 2000 workstation 
(Beckman Coulter, Fullerton, CA) capable of transferring portions of colonies to a microtiter 
dish for PGR reactions is used. The number of PGR products to be used is determined by our 
ability to obtain and assay for the presence of productive PGR products. With a dedicated 

5 thermocycler running full time, 1 92 reactions per day can be prepared and checked by 

electrophoresis, and within 3 months 1 1,520 reactions can occur. Given the insert sizes, this 
represents 0.7 coverage of the genome, over 5000 open reading frames based on an estimate 
of 8000 in the genome (B.B Magee, Gold Spring Harbor, Sept. 1999). The number of genes 
in regulatory networks of other fungi suggests that multiple genes are isolated using this 

10 strategy. 

A mini-array membrane spotted with PGR products is then be hybridized with 
radiolabeled cDNAs as described above. To avoid artifacts inherent in the use of different 
membranes and different spotting efficiencies, the same membrane is used for multiple 
labeled cDNA preparations by stripping the membrane between uses. The ability to strip 

1 5 with hot SDS solutions to remove at least 97.5% of the signal from individual spots and reuse 
nylon membranes is verified in control experiments using mini-arrays made from S, 
cerevisiae PGR products. Gox et al. To test that the membranes contain DNA that hybridize 
known genes, and to estimate the amount of the genome that is represented by the products 
on the membranes, hybridizations using ^^P-labeled DNA representing HWPl, ACTl, ENOl 

20 and other available genes is performed in control experiments. Additional control 

experiments consist of hybridization, stripping and rehybridization with the same probe, 
followed by Phosphorhnager analysis to determine the intensity ratios of the same spot in two 
different hybridizations with the same probe. The ratio should be close to 1 for all spots. 
Once membranes are judged suitable, radiolabeled cDNA from C albicans strains with or 

25 without the DNABPG described above are used in standard hybridization reactions. 

Intensities of signals at each spot is determined with ImageQuant software and ratios 
determined for each spot for the different labeled cDNA species. The intensity ratio of spots 
with HWPl DNA serves as a standard for the level of difference that would be expected for 
other genes regulated in a similar manner and to verify that the design successfiiUy identifies 

30 genes controlled by the DNABPG. The ratios of differences in intensities, and the direction of 
the difference, Le, increased or decreased for DNABPG null mutant cDNA compared to 
isogenic strains with the gene is tabulated. If the DNABPG is important for expression of a 
given hypha-specific gene, either as an activator or as a repressor, the level of mRNA for the 
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hypha-specific gene is decreased or increased respectively in the null DNABP mutant 
compared to isogenic strains (including a complemented strain) harboring the DNABPG. 
Genes like HWPI that are targets of an activating DNABPG are represented by the black spot 
on the array hybridized with labeled cDNA from strains having the DNABPG but not from 

5 strains lacking this gene. 

To further verify that the expression of DNA in the spots is controlled by the 
DNABPG(s), Northern blots are performed as stated above except that the PGR product used 
to produce the spot (newly identified target gene) is used to prepare the probe for Northern 
blotting comparing blots with RNA from the isogenic strains as described. 

1 0 The identity of the target genes controlled by the DNABPG is determined by 

retrieving the E. coli colony harboring the plasmid used to generate the PGR product that 
produced different signals with the differentially expressed cDNA. The DNA sequence of 
the plasmid insert is determined by dideoxynucleotide sequencing and used to search the C. 
albicans genome for the corresponding genes. 

1 5 The genes are analyzed by standard homology searches in order to gain insight into 

the functional category of the gene product, cell wall protein, metabohsm, signal 
transduction, secretion etc. Upsfream regions of these newly identified genes are analyzed 
for the presence of the UAS or URS, or of similar sequences that might control the strength 
of binding of the DNABP. The identity of the genes forms the bases of new hypotheses 

20 about pathogenic mechanisms that will be directly testable in proposed future experiments. 
These experiments have a very high probability of revealing a global regulatory network that 
C. albicans deploys to produce a set of genes that permit invasion and proliferation in host 
tissues. 

Mini and microarray approaches to pathogenic questions will be extremely important 
25 for advancing knowledge about C. albicans because of the present inability to employ 

random mutagenesis approaches with this diploid organism. Furthermore, the availability of 
membranes and/or chips for C. albicans is imminent. 

As an alternative approach to the mini-array scheme, differential screening of C. 
albicans cDNA libraries are performed. For identification of target proteins that are activated 
30 during germ tube induction, a lambda ZAP cDNA library prepared from germ tube mRNA is 
transferred to a nitrocellulose membrane that will be probed with "P-labeled first strand 
cDNA prepared from PolyA+ RNA from the isogenic strains with or without the DNABPGs 
that were grown in germ mbe-inducing conditions as described for the mini-arrayed 
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membranes above. For identification of target proteins that are repressed during yeast 
growth, a yeast-phase cDNA library is prepared using poly A+mRNA isolated from yeast 
forms groing at 37°C, the physiologically relevant temperature, in YNB. The library will be 
proped with ^^P-labeled first strand cDNA prepared from PolyA+ RNA from the isogenic 
5 strains with or without the DNABPGs that were grown as yeast's using the previously 
described conditions. 

cDNA clones that display differing signal strengths depending on whether the probing 
cDNA was prepared from strains with or without the DNABPG, as assessed by 
autoradiographic signal intensities, is plaque-purified. As for the mini-array experiments, 
1 0 intensities of signals of each clone are determined with ImageQuant software and ratios 
determined for each clone for the different labeled cDNA species. The intensity ratio of 
clones with HWP] DNA serves as a standard for the level of difference that would be 
expected for other genes regulated in a similar manner and to verify that the design 
successfully identifies genes controlled by the DNABPG. The ratios of differences in 
15 intensities, and the direction of the difference, i.e., increased or decreased for DNABPG null 
mutant cDNA compared to isogenic sfrains with the gene is tabulated. If the DNABPG is 
important for expression of a given hypha-specific gene, either as an activator or as a 
repressor, the level of mRNA for the hypha-specific gene is decreased or increased 
respectively in the null DNABP mutant compared to isogenic sfrains (including a 
20 complemented sfrain) harboring the DNABPG. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with specific 
preferred embodiments, it should be understood that the invention as claimed should not be 
25 unduly limited to such specific embodiments, hideed, various modifications of the described 
modes for carrying out the invention which are obvious to those skilled in molecular biology 
or related fields are intended to be within the scope of the following claims. 

The disclosures of all references and publications cited above are expressly 
incorporated by reference in their entireties to the same extent as if each were incorporated by 
30 reference individually. 
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